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Abstract. For any n > and k > 1, the density Hales-Jewett number c nt k is defined 
as the size of the largest subset of the cube [k] n := {1, ...,fc}" which contains no 
combinatorial line; similarly, the Moser number d n k is the largest subset of the cube 
[k] n which contains no geometric line. A deep theorem of Furstenberg and Katznelson 
[T2] . [H)] shows that c n ,k — o(k n ) as n — > oo (which implies a similar claim for 
d n k ); this is already non-trivial for k — 3. Several new proofs of this result have also 
been recently established [23], [2]. 

Using both human and computer-assisted arguments, we compute several values 
of Cn.,fc and c' n k for small n, k. For instance the sequence c n ,3 for n = 0, . . . , 6 is 
1, 2, 6, 18, 52, 150, 450, while the sequence d n<3 for n = 0, . . . , 6 is 1, 2, 6, 16, 43, 124, 353. 
We also prove some results for higher k, showing for instance that an analogue of the 
LYM inequality (which relates to the k = 2 case) does not hold for higher k, and also 
establishing the asymptotic lower bound c n< k > k n exp (— O^logn)) where £ is the 
largest integer such that 2k > 2 e . 



1. Introduction 



For any integers k > 1 and n > 0, let [k] '■— {1, . . . ,k}, and define [k] n to be the cube of 
words of length n with alphabet in [k). Thus for instance [3] 2 = {11, 12,13,21,22,23,31,32,33}. 

We define a combinatorial line in [k] n to be a set of the form {w(i) : % — 1, . . . , k} C 
[A;]™, where u> G ([fc] U {x}) n \[fc] n is a word of length n with alphabet in [fc] together 
with a "wildcard" letter x which appears at least once, and w(i) G [k] n is the word 
obtained from w by replacing x by i; we often abuse notation and identify w with the 
combinatorial line {w(i) : % = 1, . . . , k} it generates. Thus for instance, in [3] 2 we have 
x2 = {12,22,32} and xx = {11,22,33} as typical examples of combinatorial lines. In 
general, [k] n has k n words and (k + 1)™ — k n lines. 

A set A C [k] n is said to be line-free if it contains no combinatorial lines. Define the 
(n,k) density Hales-Jewett number c n ^ to be the maximum cardinality \A\ of a line- 
free subset of [k] n . Clearly, one has the trivial bound c n ^ < k n . A deep theorem of 
Furstenberg and Katznelson [TT], [T2] asserts that this bound can be asymptotically 
improved: 

Theorem 1.1 (Density Hales-Jewett theorem). For any fixed k > 2, one has lim^oo c ni k/k n = 
0. 
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Figure 1. Combinatorial lines in [3] 2 . 

Remark 1.2. The difficulty of this theorem increases with k. For k — 1, one clearly 
has c nj i = 1. For k = 2, a classical theorem of Sperner (28] asserts, in our language, 
that c n> 2 = (|„/2j)- The case = 3 is already non-trivial (for instance, it implies Roth's 
theorem [26] on arithmetic progressions of length three) and was first established in [11] 
(see also [12] )• The case of general k was first established in [12] and has a number of 
implications, in particular implying Szemeredi's theorem |29j on arithmetic progressions 
of arbitrary length. 



The Furstenberg-Katznelson proof of Theorem 1.1 relies on ergodic-theory techniques 
and does not give an explicit decay rate for c n ^. Recently, two further proofs of this 
theorem have appeared, by Austin [2] and by the sister Polymath project to this one [23J. 
The proof of [2] also uses ergodic theory, but the proof in [23] is combinatorial and gave 
effective bounds for c n ^ in the limit n — » oo. For example, if n can be written as an 
exponential tower 2|2t2|...|2 with m 2s, then c n ^ <C 3 n m -1 / 2 . However, 
these bounds are not believed to be sharp, and in any case are only non-trivial in the 
asymptotic regime when n is sufficiently large depending on k. 

Our first result is the following asymptotic lower bound. The construction is based 
on the recent refinements [9] [HI EJ] of a well-known construction of Behrend [3] and 



Rankin [25J. The proof of Theorem 1.3 is in Section v2A Let rk(n) be the maximum size 



of a subset of [n] that does not contain a /c-term arithmetic progression. 

Theorem 1.3 (Asymptotic lower bound for c n ^)- For each k > 3, there is an absolute 
constant C > such that 

fc-i 



c n ,k > Ck n ( ! ^=S) = k n exp (-0(^ 



n. 



where i is the largest integer satisfying 2k > 2 e . More specifically, 

c nk > C k n ~ a ^ "^ log n+ ^( fc ) lo § lo s n 

where all logarithms are base-k, and a(k) = (log2) 1_1/ ' £ £2^ 1 ^ 2_1 ^ and (3(k) = (k — 
l)/(2£). 
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Figure 2. Geometric lines in [3] 2 . 

In the case of small n, we focus primarily on the first non-trivial case k = 3. We have 
computed the following explicit values of c„ )3 (entered in the OEIS |2T] as A156762): 

Theorem 1.4 (Explicit values of c„ j3 ). We have c 0j3 = 1, Ci j3 = 2, c 2j3 = 6 ; c 3j3 = 18, 
c 43 = 52, C53 = 150, and c 63 = 450. 



This result is established in Sections |2J |3j Initially these results were established by an 
integer program, but we provide completely computer-free proofs here. The construc- 
tions used in Section [2] give reasonably efficient constructions for larger values of n; for 
instance, they show that 3" < cioo,3 < 2 x 3". See Section for further discussion. 



A variant of the density Hales- Jewett theorem has also been studied in the literature. 
Define a geometric line in [k] n to be any set of the form {a + ir : i — 1, . . . , A;} in [k] n , 
where we identify [k] n with a subset of Z n , and a,r 6 Z n with r ^ 0. Equivalently, a 
geometric line takes the form {w(i, k+l—i) : i — 1, . . . , k}, where w G ([/c]U{x, x}) n \[A;] ?1 
is a word of length n using the numbers in [k] and two wildcards x,x as the alphabet, 
with at least one wildcard occurring in w, and w(i,j) 6 [k] n is the word formed by 
substituting i, j for x,x respectively. Figure [2] shows the eight geometric lines in [3] 2 . 
Clearly every combinatorial line is a geometric line, but not conversely. In general, [k] n 
has ((k + 2) n — k n )/2 geometric lines. 

Define a Moser set in [k] n to be a subset of [k] n that contains no geometric lines, and let 
c' nk be the maximum cardinality \A\ of a Moser set in [k] n . Clearly one has c' nk < c n ^, 
so in particular from Theorem 1.1 one has c' n k /k n — > as n — > 00. (Interestingly, there 
is no known proof of this fact that does not go through Theorem |1.1| even for k — 3.) 
Again, k = 3 is the first non-trivial case: it is clear that c' nl = and c' n 2 = 1 for all n. 

The question of computing c' n 3 was first posed by Moser [TH] . Prior to our work, the 
values 

c o,3 = 1; c i,3 = 2; c 2 3 = 6; c 3 3 = 16; c 4 3 = 43 
were known [8], [6] (this is Sequence A003142 in the OEIS [21J). We extend this sequence 
slightly: 
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Theorem 1.5 (Values of c' n3 for small n). We have c' 03 = 1, d iz = 2, c' 23 = 6, c 33 = 16, 
C4 3 = 43, Cg 3 = 124, and c' 63 = 353. 

This result is established in Sections |4j [5j The arguments given here are computer- 
assisted; however, we have found alternate (but lengthier) computer-free proofs for the 
above claims with the the exception of the proof of c 6 3 = 353, which requires one non- 



trivial computation (Lemma 5.13). These alternate proofs are not given in this paper 



to save space, but can be found at [2 

We establish a lower bound for this problem of (2+o(l)) ( n J2 l < c' n 3 , which is maximized 
for i near 2n/3. This bound is around one-third better than the previous literature [18J, 
[TJ. We also give methods to improve on this construction. 

Earlier lower bounds were known. Indeed, let A(n, d) denote the size of the largest 
binary code of length n and minimal distance d. Then 



2' 3 > max f ( . ) 



A{n-j,k-j + l) . (1.1) 



which, with A(n, 1) = 2 n and A(n, 2) = 2 n , implies in particular that 



71 + 1 



<s> ^^jJS^J- 1 (1.2) 

for n > 2. This bound is not quite optimal; for instance, it gives a lower bound of 
4,3 > 344. 

Remark 1.6. Let c" 3 be the size of the largest subset of F 3 which contains no lines 
x, x + r, x + 2r with x, r G F 3 and r 7^ 0, where F3 is the field of three elements. Clearly 
one has c" 3 < c' n 3 < c n $. It is known that 

c o,3 = lj c i,3 = 2; c 2 3 = 4; c 3)3 = 9; c 4 3 = 20; c 5 3 = 45; c 6 3 = 112; 

see [22]. 

As mentioned earlier, the sharp bound on c n ^ comes from Sperner's theorem. It is 
known that Sperner's theorem can be refined to the Lubell-Yamamoto-Meshalkin (LYM) 
inequality, which in our language asserts that 

\A n r ai)a2 1 



E 



ir 1 



< 1 



ai ,a2>0;ai+a2=rt 

for any line-free subset A C [2] n , where the cell T air ,_ }ak C [/c] n is the set of words in 
[k] n which contain exactly aj i's for each % — 1, . . . , k. It is natural to ask whether this 
inequality can be extended to higher k. Let A n fc denote the set of all tuples (a 1; . . . , a^) 
of non-negative integers summing to n, define a simplex to be a set of k points in A n ^ of 
the form (ai+r, a 2 , . . . , a k ), (ai, a 2 +r, . . . , a k ), . . . , (a 1; a 2 , . . . , a k +r) for some < r < n 
and ai, . . . , 0^ summing to n — r, and define a Fujimura se^to be a subset i? C A„ fc 

1 Fujimura actually proposed the related problem of finding the largest subset of A n that contained 
no equilateral triangles; see |10j . Our results for Fujimura sets can be found at the page Fujimura 's 
problem at [24"] . 
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Figure 3. A Fujimura set in A 6>3 , displayed in "rectangular" coordi- 
nates. The point (a, b, c) is represented by a square at (a, b) labeled with 
c. The Fujimura set is shown in red; its complement in Ag,3 is shown in 
gray 
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Figure 4. A Fujimura set in At^, expressed in "triangular" coordinates, 
which contains no simplices. Observe that if w is a combinatorial line in [k] n , then 

tf(l) £ T ai+r,a2,...,a,f {: j ^\~) £ ^ a\,a2+r,...,ak i • • • j W\k) £ IT ai,a,2,...,a,k+r 

for some simplex (ai + r, a 2 , . . . , a fc ), (ai, a 2 + r, . . . , a fc ), . . . , (ai, a 2 , . . . , a fe + r). Thus, if 
-B is a Fujimura set, then A := |J 5eB is line-free. Note also that 

yr |Anr g | = 

This motivates a "hyper-optimistic" conjecture: 

Conjecture 1.7. For any > 1 and n > 0, and any line-free subset A of [k] n , one has 

/ , i r I — L n,fc> 

-r- A S 
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where c p nk is the maximal size of a Fujimura set in A n .&. 



One can show that this conjecture for a fixed value of k would imply Theorem for 



the same value of k, in much the same way that the LYM inequality is known to imply 



Sperner's theorem. The LYM inequality asserts that Conjecture |1.7| is true for k < 2. 
As far as we know, this conjecture could hold in k = 3. However, we found a simple 
counterexample for k — 4 and n = 2, given by the line-free set 

A := {(1, 1), (1, 2), (1, 3), (2, 1), (2, 3), (2, 4), (3, 2), (3, 3), (3, 4), (4, 1), (4, 2), (4, 4)} 

together with the computation that c% 2 — 7 . It is in fact likely that this conjecture fails 
for all higher k also. 



1.1. Notation. There are several subsets of [k] n which will be useful in our analysis. 
We have already introduced combinatorial lines, geometric lines, and cells. One can 
generalise the notion of a combinatorial line to that of a combinatorial subspace in [k] n 
of dimension d, which is indexed by a word w in ([k] U {x±, . . . , Xd}) n containing at least 
one of each wildcard x\, . . . , xj, and which forms the set {w(h, . . . , id) : i\, . . . , id € [k]}, 
where w(i\, . . . ,id) 6 [k] d is the word formed by replacing x\, . . . , Xd with i\, . . . , id 
respectively. Thus for instance, in [3] 3 , we have the two-dimensional combinatorial sub- 
space xxy = {111, 112, 113, 221, 222, 223, 331, 332, 333}. We similarly have the notion of 
a geometric subspace in [k] n of dimension d, which is defined similarly but with d wild- 
cards xi, . . . , Xd, x~i, . . . , x~d, with at least one of either Xi or xl appearing in the word w 
for each 1 < i < d, and the space taking the form {w(ii, ■ ■ ■ ,id, k+l—ii, . . . , k + 1 — i d ) : 
ii, . . . , id G [k]}. Thus for instance [3] 3 contains the two-dimensional geometric subspace 
xxy = {131, 132, 133, 221, 222, 223, 311, 312, 313}. 

An important class of combinatorial subspaces in [k] n will be the slices consisting of n — 1 
distinct wildcards and one fixed coordinate. We will denote the distinct wildcards here 
by asterisks, thus for instance in [3] 3 we have 2** = {211, 212, 213, 221, 222, 223, 231, 232, 233}. 
Two slices are parallel if their fixed coordinate are in the same position, thus for instance 
1 * * and 2 * * are parallel, and one can subdivide [k] n into k parallel slices, each of 
which is isomorphic to [A;] n_1 . In the analysis of Moser slices with k = 3, we will make a 
distinction between centre slices, whose fixed coordinate is equal to 2, and side slices, in 
which the fixed coordinate is either 1 or 3, thus [3] n can be partitioned into one centre 
slice and two side slices. 

Another important set in the study of k — 3 Moser sets are the spheres 5*j. n C [3] n , 
defined as those words in [3] n with exactly n — i 2's (and hence i letters that are 1 or 3). 
Thus for instance S lj3 = {122,322,212,232,221,223}. Observe that [3] n = \J7=o S i,n, 
and each S^ n has cardinality \S^ n \ = (")2\ 

It is also convenient to subdivide each sphere S ijn into two components S ijU = S° n \JSf n , 
where S° n are the words in S^ n with an odd number of l's, and Sf n are the words with an 
even number of l's. Thus for instance = {122, 212, 221} and Sf >3 = {322, 232, 223}. 
Observe that for i > 0, S° n and Sf n both have cardinality (™)2 l_1 . 
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The Hamming distance between two words w, w' is the number of coordinates in which 
w,w' differ, e.g. the Hamming distance between 123 and 321 is two. Note that Si >n is 
nothing more than the set of words whose Hamming distance from 2 ... 2 is i, which 
justifies the terminology "sphere". 

In the density Hales- Jewett problem, there are two types of symmetries on [k] n which 
map combinatorial lines to combinatorial lines (and hence line-free sets to line-free 
sets). The first is a permutation of the alphabet [k]; the second is a permutation of 
the n coordinates. Together, this gives a symmetry group of order k\n\ on the cube 
[k] n , which we refer to as the combinatorial symmetry group of the cube [k] n . Two sets 
which are related by an element of this symmetry group will be called (combinatorially) 
equivalent, thus for instance any two slices are combinatorially equivalent. 

For the analysis of Moser sets in [k] n , the symmetries are a bit different. One can 
still permute the n coordinates, but one is no longer free to permute the alphabet [k]. 
Instead, one can reflect an individual coordinate, for instance sending each word x\ . . . x n 
to its reflection x\ . . . X{-i(k + 1 — Xi)Xi+\ . . . x n . Together, this gives a symmetry group 
of order 2 n n\ on the cube [k] n , which we refer to as the geometric symmetry group of 
the cube [k] n ; this group maps geometric lines to geometric lines, and thus maps Moser 
sets to Moser sets. Two Moser sets which are related by an element of this symmetry 
group will be called (geometrically) equivalent. For instance, a sphere Si >n is equivalent 
only to itself, and S° n , Sf n are equivalent only to each other. 



1.2. About this project. This paper is part of the Polymath project, which was 
launched by Timothy Gowers in February 2009 as an experiment to see if research 
mathematics could be conducted by a massive online collaboration. The first project in 
this series, Polymathl, was focused on understanding the density Hales- Jewett numbers 
c nt k, and was split up into two sub-projects, namely an (ultimately successful) attack 
on the density Hales- Jewett theorem c n ^ = o{k n ) (resulting in the paper [23J), and a 
collaborative project on computing c n ^ and related quantities (such as c' n k ) for various 
small values of n and k. This project (which was administered by Terence Tao) resulted 
in this current paper. 

Being such a collaborative project, many independent aspects of the problem were 
studied, with varying degrees of success. For reasons of space (and also due to the partial 
nature of some of the results), this paper does not encompass the entire collection of 
observations and achievements made during the research phase of the project (which 
lasted for approximately three months). In particular, alternate proofs of some of the 
results here have been omitted, as well as some auxiliary results on related numbers, 
such as coloring Hales- Jewett numbers. However, these results can be accessed from the 
web site of this project at [21]. We are indebted to Michael Nielsen for hosting this web 
site, which performed a crucial role in the project. A list of contributors to the project 
(and the grants that supported these individuals) can also be found at this site. 
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2. Lower bounds for the density Hales-Jewett problem 
The purpose of this section is to establish various lower bounds for c„ i3 , in particular 



establishing Theorem 1.3 and the lower bound component of Theorem 1.4 



As observed in the introduction, if B C A n)3 is a Fujimura set (i.e. a subset of A„ )3 = 
{(a, 6, c) G N 3 : a + b + c = n} which contains no upward equilateral triangles (a + 
r, b, c), (a, b + r, c), (a, 6, c + r)), then the set A B '■— Uaes ^a,b,c is a line-free subset of 
[3] n , which gives the lower bound 

<^>i^i = £ ^ ("> 

(a,b,c)eB 

All of the lower bounds for c ni3 in this paper will be constructed via this device. (Indeed, 



one may conjecture that for every n there exists a Fujimura set B for which (2.1) is 
attained with equality; we know of no counterexamples to this conjecture.) 



In order to use (2.1), one of course needs to build Fujimura sets B which are "large" 



in the sense that the right-hand side of (2.1) is large. A fruitful starting point for this 
goal is the sets 

B jt n := {(a, b, c) G A n>3 : a + 2b j mod 3} 

for j = 0, 1, 2. Observe that in order for a triangle (a + r, b, c), (a, b + r, c), (a, b,c + r) 
to lie in the length r of the triangle must be a multiple of 3. This already makes 
Bj >n a Fujimura set for n < 3 (and B 0>n a Fujimura set for n = 3). 

When n is not a multiple of 3, the B^ n are all rotations of each other and give equivalent 
sets (of size 2 x 3™" 1 ). When n is a multiple of 3, the sets B\ >n and £?2,n are reflections 
of each other, but Bq }11 is not equivalent to the other two sets (in particular, it omits all 
three corners of A n 3 ); the associated set A Bo n is slightly larger than A Bl n and A B2 n 
and thus is slightly better for constructing line-free sets. 

As mentioned already, B n is a Fujimura set for n < 3, and hence A Bon is line-free for 



n < 3. Applying (2.1) one obtains the lower bounds 

c ,3 > 1; Ci, 3 > 2; c 2 , 3 > 6; c 3j3 > 18. 

For n > 3, B , n contains some triangles (a + r, 6, c), (a, 6 + r, c), (a, 6, c + r) and so is 
not a Fujimura set, but one can remove points from this set to recover the Fujimura 
property. For instance, for n < 6, the only triangles in i?o,n have side length r = 3. One 
can "delete" these triangles by removing one vertex from each; in order to optimise the 



bound (2.1) it is preferable to delete vertices near the corners of A n 3 rather than near 
the centre. These considerations lead to the Fujimura sets 

fl„,4\{(0,0, 4), (0,4,0), (4, 0,0)} 

^{(0,4,1), (0,5,0), (4, 0,1), (5, 0,0)} 

£o, 6 \{(0, 1, 5), (0, 5, 1), (1, 0, 5), (0, 1, 5), (1, 5, 0), (5, 1, 0)} 
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Figure 5. Lower bounds for c n obtained by the Ag construction. 



which by (2.1) gives the lower bounds 

c 4 ,3 > 52; c 5 ,3 > 150; c 6 , 3 > 450. 
Thus we have established all the lower bounds needed for Theorem 11.41 

One can of course continue this process by hand, for instance the set 

B 0jr \{(0, 1, 6), (1, 0, 6), (0, 5, 2), (5, 0, 2), (1, 5, 1), (5, 1, 1), (1, 6, 0), (6, 1, 0)} 

gives the lower bound C7.3 > 1302, which we tentatively conjecture to be the correct 
bound. 

A simplification was found when n is a multiple of 3. Observe that for n = 6, the sets 
excluded from B Q 6 are all permutations of (0, 1, 5). So the remaining sets are all the 
permutations of (1, 2, 3) and (0, 2, 4). In the same way, sets for n = 9, 12 and 15 can be 
described as: 

• n = 9: (2, 3, 4), (1, 3, 5), (0, 4, 5) and permutations; 

• n = 12: (3, 4, 5), (2, 4, 6), (1, 5, 6), (0, 2, 10), (0, 5, 7) and permutations; 

. n = 15: (4, 5, 6), (3, 5, 7), (2, 6, 7), (1, 3, 11), (1, 6, 8), (0, 4, 11), (0, 7, 8) and per- 
mutations. 



When n is not a multiple of 3, say n = 3m — 1 or n = 3m — 2, one first finds a solution 
for n = 3m. Then for n = 3m — 1, one restricts the first digit of the 3m sequence 
to equal 1. This leaves exactly one-third as many points for 3m — 1 as for 3m. For 
n = 3m — 1, one restricts the first two digits of the 3m sequence to be 12. This leaves 
roughly one-ninth as many points for 3m — 2 as for 3m. 



An integer prograrqjwas solved to obtain the maximum lower bound one could establish 
from (2.1). The results for 1 < n < 20 are displayed in Figure [5j More complete data, 
including the list of optimisers, can be found at [T7] . 



2 Details of the integer programming used in this paper can be found at the page Integer.tex at 
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For medium values of n, in particular for integers 21 < n < 999 that are a multiple 



of 3, n — 3m, the best general lower bound for c nj3 was found by applying (2.1) to 
the following Fujimura set construction. It is convenient to write [a, b, c] for the point 
(m + a, m + b,m + c), together with its permutations, with the convention that [a, b, c] 
is empty if these points do not lie in A n 3 . Then a Fujimura set can be constructed by 
taking the following groups of points: 

(1) The thirteen groups of points 

[-7, -3, +10], [-7, 0, +7], [-7, +3, +4], [-6, -4, +10], [-6, -1, +7], [-6, +2, +4] 

[-5, -1, +6], [-5, +2, +3], [-4, -2, +6], [-4, +1, +3], [-3, +1, +2], [-2, 0, +2], [-1, 0, +1]; 

(2) The four families of groups 

[-8 - y - 2x, -6 + y - 2x, 14 + Ax], [-8 - y - 2x, -3 + y - 2x, 11 + Ax], 

[—8 — y — 2x, x + y, 8 + x], [—8 — 2x, 3 + x, 5 + x] 
for x > and y — 0, 1. 

Numerical computation shows that this construction gives a line-free set in [3] n of density 



approximately 2.7 y for n < 1000; for instance, when n = 99, it gives a line- free set 

of density at least 1/3. Some additional constructions of this type can be found at the 
page Upper and lower bounds at 



However, the bounds in Theorem 1.3, which we now prove, are asymptotically superior 
to these constructions. 



Proof of Theorem L3_, Let M be the circulant matrix with first row (1, 2, . . . , k — 1), 
second row (A; — 1, 1, 2, . . . , k — 2), and so on. Note that M has nonzero determinant by 
well-known propertied of circulant matrices, see e.g. [TBI Theorem 3]. 



Let S be a subset of the interval [— \/n/2, y/n/2) that contains no nonconstant arithmetic 
progressions of length k, and let B C A njfc be the set 



k-i 



B := {(n — ^2 a h «i) «2, • • • , a k -i) ■ (a-i, • • • , a k -i) = c + det(M)M 1 s, s E S k 1 }, 



where c is the — 1-dimensional vector, all of whose entries are equal to \n/k\. The 
map (m, a\, . . . , a^i) ^ M(ai, . . . , a^-i) takes simplices in A n ^ to nonconstant arith- 
metic progressions in and takes B to {Mc + det(M) s: s G S^ -1 }, which is a set 
containing no nonconstant arithmetic progressions. Thus, B is a Fujimura set and so 
does not contain any combinatorial lines. 



3 For instance, if we let Ai denote the i th row, we see that (Ai — A 2 ) + (+i+i — Ai) is of the form 
(0, . . . , 0, —k + 1, 0, . . . , 0, k — 1), and so the row space spans all the vectors whose coordinates sum to 
zero; but the first row has a non-zero coordinate sum, so the rows in fact span the whole space. 
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If all of d, . . . , cifc are within C\^fn of n/k, then \Ts\ > Ck n /n^ k ~ 1 '' 2 (where C depends 
on Ci) by the central limit theorem. By our choice of S and applying (2.1) (or more 
precisely, the obvious generalisation of (2.1) to other values of k), we obtain 

Cnk > Ck n /n^ k - 1)/2 \S\ k - 1 = Ck r - 



\S\ 



One can take S to have cardinality r&(\/n), which from the results of O'Bryant [20J 
satisfies (for all sufficiently large n, some C > 0, and I the largest integer satisfying 
/,• -2' ') 

r k (y/n) 



n 



> C(logn) 1 /^) exp 2 (- 



which completes the proof. 



□ 



3. Upper bounds for the k = 3 density Hales- Jewett problem 



To finish the proof of Theorem 1.4 we need to supply the indicated upper bounds for 
c nj3 for n = 0, . . . , 6. 

It is clear that co,3 = 1 and = 2. By subdividing a line-free set into three parallel 
slices we obtain the bound 

C n +1,3 < 3c Hi 3 

for all n. This is already enough to get the correct upper bounds c 2j 3 < 6 and c 3j3 < 18, 
and also allows us to deduce the upper bound c 6>3 < 450 from c 5)3 < 150. So the 
remaining tasks are to establish the upper bounds 

c 4 ,3 < 52 (3.1) 

and 

c 5 ,3 < 150. (3.2) 



3 



In order to establish (3.2), we will rely on (3.1), together with a classification of those 
line-free sets in [3] 4 of size close to the maximal number 52. Similarly, to establish (3.1 ) 
we will need the bound c 3)3 < 18, together with a classification of those line-sets in [3] 
of size close to the maximal number 18. Finally, to achieve the latter aim one needs to 
classify the line- free subsets of [3] 2 with exactly c 2j 3 = 6 elements. 

3.1. n — 2. We begin with the n = 2 theory. 

Lemma 3.1 (n = 2 extremals). There are exactly four line-free subsets of [3] 2 of car- 
dinality 6: 

• The set x := A B2 2 = {12, 13, 21, 22, 31, 33}; 

• The set y := A B21 = {11, 12, 21, 23, 32, 33}; 

• The set z := A B20 = {11, 13, 22, 23, 31, 32}; 

• The set w := {12, 13, 21, 23, 31, 32}. 
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Proof. A line-free subset of [3] 2 must have exactly two elements in every row and column. 
The claim then follows by brute force search. □ 

3.2. n — 3. Now we turn to the n = 3 theory. We can slice [3] 3 as the union of three 
slices 1**, 2**, 3**, each of which are identified with [3] 2 in the obvious manner. 
Thus every subset A in [3] 3 can be viewed as three subsets Ai,A 2 ,A 3 of [3] 2 stacked 
together; if A is line-free then A\,A 2 ,A^ are necessarily line-free, but the converse is 
not true. We write A = A\A 2 A^, thus for instance xyz is the set 

xyz = {112, 113, 121, 122, 131, 133}U{211, 212, 221, 223, 232, 233}U{311, 313, 322, 323, 331, 332}. 
Observe that 

A b 0>3 = xyz; A Bl3 = yzx; A B%3 = zxy. 

Lemma 3.2 in = 3 extremals). The only 18-element line-free subset of[3} 3 is xyz. The 
only 17 '-element line-free subsets of [3] 3 are formed by removing a point from xyz, or by 
removing either 111, 222, or 333 from yzx or zxy. 

Proof. We prove the second claim. As 17 = 6 + 6 + 5, and 02,3 = 6, at least two of the 
slices of a 17-element line-free set must be from x, y, z, w, with the third slice having 
5 points. If two of the slices are identical, the last slice must lie in the complement and 
thus has at most 3 points, a contradiction. If one of the slices is a w, then the 5-point 
slice consists of the complement of the other two slices and thus contains a diagonal, 
contradiction. By symmetry we may now assume that two of the slices are x and y, 
which force the last slice to be z with one point removed. Now one sees that the slices 
must be in the order xyz, yzx, or zxy, because any other combination has too many 
lines that need to be removed. The sets yzx, zxy contain the diagonal {111,222,333} 
and so one additional point needs to be removed. 

The first claim follows by a similar argument to the second. □ 

3.3. n = 4. Now we turn to the n = 4 theory. 
Lemma 3.3. c 4 3 < 52. 

Proof. Let A be a line-free set in [3] 4 , and split A = A1A2A3 for A\, A2, A3 e [3] 3 as in 
the n = 3 theory. If at least two of the slices A\,A 2 ,A^ are of cardinality 18, then by 



Lemma 3.2 they are of the form xyz, and so the third slice then lies in the complement 
and has at most six points, leading to an inferior bound of 18 + 18+6 = 42. Thus at most 
one of the slices can have cardinality 18, leading to the bound 18 + 17 + 17 = 52. □ 



Now we classify extremisers. Observe that we have the following (equivalent) 52-point 
line-free sets, which were implicitly constructed in the previous section; 

• E :=A Bo4 \{im,2222}; 

• E l := A Bl4 \{2222,3333}; 

• E 2 := A B24 \{1111,3333}. 
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Lemma 3.4. 



• The only 52-element line-free sets in [3] 4 are E , E\, E 2 . 

• The only 51-element line-free sets in [3] 4 are formed by removing a point from 
E Q , E x or E 2 . 

• The only 50-element line-free sets in [3] 4 are formed by removing two points 
from Eq, Ei or E 2 OR are equal to one of the three permutations of the set 

X := ^1,0 U 1*3,0,1 U ^,2,0 U r2,0,2 U L 1,1,2 U Ti^,! u ro,2,2- 



Proof. We will just prove the third claim, which is the hardest; the first two claims 
follow from the same argument (and can in fact be deduced directly from the third 
claim). 

It suffices to show that every 50-point line-free set is either contained in the 54-point set 
A B . 4 for some j = 0, 1, 2, or is some permutation of the set X. Indeed, if a 50-point line- 
free set is contained in, say, A Boi , then it cannot contain 2222, since otherwise it must 
omit one point from each of the four pairs formed from {2333, 2111} by permuting the 
indices, and must also omit one of {1111, 1222, 1333}, leading to at most 49 points in all; 
similarly, it cannot contain 1111, and so omits the entire diagonal {1111,2222,3333}, 
with two more points to be omitted. By symmetry we see the same argument works 
when A Bo4 is replaced by one of the other A Bj4 . 



Next, observe that every three-dimensional slice of a line-free set can have at most 
C3,3 = 18 points; thus when one partitions a 50-point line-free set into three such slices, 
it must divide either as 18 + 16 + 16, 18 + 17 + 15, 17 + 17 + 16, or some permutation of 
these. Suppose that we can slice the set into two slices of 17 points and one slice of 16 
points. By the various symmetries, we may assume that the 1 * ** slice and 2 * ** slices 
have 17 points, and the 3*** slice has 16 points. By Lemma 3.2, the 1-slice is {1} x D 3 j 
with one point removed, and the 2-slice is {2} x _D 3fc with one point removed, for some 
j, k e {0, 1, 2}. If j = k, then the 1-slice and 2-slice have at least 15 points in common, 
so the 3-slice can have at most 27 — 15 = 12 points, a contradiction. If jk = 01, 12, 
or 20, then observe that from Lemma 3J2 the *1 * *, *2 * *, *3 * * slices cannot equal 
a 17-point or 18-point line-free set, so each have at most 16 points, leading to only 48 
points in all, a contradiction. Thus we must have jk = 10, 21, or 02. 



First suppose that jk = 02. Then by Lemma 3.2, the 2 * ** slice contains the nine 



points formed from {2211, 2322, 2331} and permuting the last three indices, while the 
1 * ** slice contains at least eight of the nine points formed from {1211, 1322, 1311} and 
permuting the last three indices. Thus the 3 * ** slice can contain at most one of the 
nine points formed from {3211,3322,3311} and permuting the last three indices. If it 
does contain one of these points, say 3211, then it must omit one point from each of the 
four pairs {3222, 3233}, {3212, 3213}, {3221, 3231}, {3111, 3311}, leading to at most 15 
points on this slice, a contradiction. So the 3 * ** slice must omit all nine points, and is 
therefore contained in {3} x 1)3,1, and so the 50-point set is contained in -D^i, and we 
are done by the discussion at the beginning of the proof. 
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The case jk = 10 is similar to the jk = 02 case (indeed one can get from one case to 
the other by swapping the 1 and 2 indices). Now suppose instead that jk = 12. Then 



by Lemma 3.2, the 1 * ** slice contains the six points from permuting the last three 
indices of 1123, and similarly the 2 * ** slice contains the six points from permuting 
the last three indices of 2123. Thus the 3 * ** slice must avoid all six points formed by 
permuting the last three indices of 3123. Similarly, as 1133 lies in the 1 * ** slice and 
2233 lies in the 2 * ** slice, 3333 must be avoided in the 3 * ** slice. 



Now we claim that 3111 must be avoided also; for if 3111 was in the set, then one point 
from each of the six pairs formed from {3311,3211}, {3331,3221} and permuting the 
last three indices must lie outside the 3 * ** slice, which reduces the size of that slice to 
at most 27 — 6 — 1 — 6 = 14, which is too small. Similarly, 3222 must be avoided, which 
puts the 3 * ** slice inside {3} x D 3 and then places the 50-point set inside _D 4 , and we 
are done by the discussion at the beginning of the proof. 



We have handled the case in which at least one of the slicings of the 50-point set is of 
the form 50 = 17+17+16. The only remaining case is when all slicings of the 50-point 
set are of the form 18 + 16 + 16 or 18 + 17+15 (or a permutation thereof). So each slicing 
includes an 18-point slice. By the symmetries of the situation, we may assume that the 
1 * ** slice has 18 points, and thus by Lemma [3~2 takes the form {1} x D 3 . Inspecting 
the *1 * *, *2 * *, *3 * * slices, we then see (from Lemma 3.2) that only the *1 * * 
slice can have 18 points; since we are assuming that this slicing is some permutation of 
18 + 17 + 15 or 18 + 16 + 16, we conclude that the *1 * * slice must have exactly 18 
points, and is thus described precisely by Lemma 3.2 Similarly for the * * 1* and * * *1 



slices. Indeed, by Lemma 3.2, we see that the 50-point set must agree exactly with D^ i 
on any of these slices. In particular, there are exactly six points of the 50-point set in 
the remaining portion {2, 3} 4 of the cube. 



Suppose that 3333 was in the set; then since all permutations of 3311, 3331 are known 
to lie in the set, then 3322, 3332 must lie outside the set. Also, as 1222 lies in the set, 
at least one of 2222, 3222 lie outside the set. This leaves only 5 points in {2, 3} 4 , a 
contradiction. Thus 3333 lies outside the set; similarly 2222 lies outside the set. 

Let a be the number of points in the 50-point set which are some permutation of 2233, 
thus < a < 6. Ifa = then the set lies in and we are done. If a = 6 then the 
set is exactly X and we are done. Now suppose a = 1. By symmetry we may assume 
that 2233 lies in the set. Then (since 2133, 1233, 2231, 2213 are known to lie in the set) 
2333, 3233, 2223, 2232 lie outside the set, which leaves at most 5 points inside {2, 3} 4 , a 
contradiction. A similar argument holds if a = 2,3. 

The remaining case is when a = 4,5. Then one of the three pairs {2233,3322}, 
{2323, 3232}, {2332, 3223} lie in the set. By symmetry we may assume that {2233, 3322} 
lie in the set. Then by arguing as before we see that all eight points formed by per- 
muting 2333 or 3222 lie outside the set, leading to at most 5 points inside {2, 3} 4 , a 
contradiction. □ 
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3.4. n — 5. Finally, we turn to the n = 5 theory. Our goal is to show that 05,3 < 150. 
Accordingly, suppose for contradiction that we can find a line-free subset A of [3] 5 of 
cardinality \A\ = 151. We will now prove a series of facts about A which will eventually 
give the desired contradiction. 

Lemma 3.5. A is not contained inside Abj 5 for any j = 0, 1, 2. 



Proof. Suppose for contradiction that A C Ab 5 for some j. By symmetry we may take 
j = 0. The set Ab 5 has 162 points. By looking at the triplets {10000, 11110, 12220} and 
cyclic permutations we must lose 5 points; similarly from the triplets {20000, 22220, 21110} 
and cyclic permutations. Finally from {11000, 11111, 11222} and {22000, 22222, 22111} 
we lose two more points. Since 162 — 5 — 5 — 2 = 150, we obtain the desired contradic- 
tion. □ 



Observe that every slice of A contains at most c 4i 3 = 52 points, and hence every slice 
of A contains at least 151 — 52 — 52 = 47 points. 

Lemma 3.6. A cannot have two parallel [3] 4 slices, each of which contain at least 51 
points. 



Proof. Suppose not that A has two parallel [3] 4 slices. By symmetry, we may assume 
that the 1 * * * * and 2 * * * * slices have at least 51 points. Meanwhile, the 3 * * * * 
slice has at least 47 points as discussed above. 



By Lemma 3.4, the 1 * * * * slice takes the form {1} x _D 4j - for some j = 0, 1,2 with 
the diagonal {11111, 12222, 13333} and possibly one more point removed, and similarly 
the 2 * * * * slice takes the form {2} x D^k for some k = 0,1,2 with the diagonal 
{21111,22222,23333} and possibly one more point removed. 

Suppose first that j = k. Then the 1-slice and 2-slice have at least 50 points in common, 
leaving at most 31 points for the 3-slice, a contradiction. Next, suppose that jk = 01. 
Then observe that the *i * ** slice cannot look like any of the configurations in Lemma 



3.4 and so must have at most 50 points for i = 1,2, 3, leading to 150 points in all, a 



contradiction. Similarly if jk = 12 or 20. Thus we must have jk equal to 10, 21, or 02. 

Let's suppose first that jk = 10. The first slice then is equal to {1} x _D 41 with 
the diagonal and possibly one more point removed, while the second slice is equal to 
{2} x _D 4j0 with the diagonal and possibly one more point removed. Superimposing 
these slices, we thus see that the third slice is contained in {3} x _D 4 2 except possibly 
for two additional points, together with the one point 32222 of the diagonal that lies 
outside of {3} x _D 4j2 . 

The lines xl2xx,xl3xx (plus permutations of the last four digits) must each contain 
one point outside the set. The first two slices can only absorb two of these, and so at 
least 14 of the 16 points formed by permuting the last four digits of 31233, 31333 must 
lie outside the set. These points all lie in {3} x _D 4 .2, and so the 3 * * * * slice can have 
at most I-D42I — 14 + 3 = 43 points, a contradiction. 
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The case jk = 02 is similar to the case jk = 10 (indeed one can obtain one from the 
other by swapping 1 and 2). Now we turn to the case jk = 21. Arguing as before we 
see that the third slice is contained in {3} x _D 4 except possibly for two points, together 
with 33333. 

If 33333 was in the set, then each of the lines xx333, xxx33 (and permutations of the 
last four digits) must have a point missing from the first two slices, which cannot be 
absorbed by the two points we are permitted to remove; thus 33333 is not in the set. 
For similar reasons, 33331 is not in the set, as can be seen by looking at xxx31 and 
permutations of the last four digits. Indeed, any string containing four threes does not 
lie in the set; this means that at least 8 points are missing from {3} x D^, leaving only 
at most 46 points inside that set. Furthermore, any point in the 3 * * * * slice outside of 
{3} x L> 4 can only be created by removing a point from the first two slices, so the total 
cardinality is at most 46 + 52 + 52 = 150, a contradiction. □ 

Remark 3.7. This already gives the bound < 52 + 50 + 50 = 152, but of course we 
wish to do better than this. 

Lemma 3.8. A has a slice j * * * * with j = 1, 2, 3 that has at most 49 points. 



Proof. Suppose not, thus all three slices of A has at least 50 points. Using earlier 
notation, we split subsets of [3] 4 into nine subsets of [3] 2 . So we think of x, y, z, a, b and 



c as subsets of a square. By Lemma 3.4, each slice is one of the following: 



• Eq = y'zx, zx'y, xyz (with one or two points removed) 

• Ei = xyz, yz'x, zxy' (with one or two points removed) 

• E2 — z'xy, xyz, yzx' (with one or two points removed) 

• X = xyz, ybw, zwc 

• Y = axw, xyz, wzc 

• Z = awx, wby, xyz 

where a, b and c have four points each: a = {2, 3} 2 , b = {1, 3} 2 and c = {1, 2} 2 . x', y' 
and z' are subsets of x, y and z respectively, and have five points each. 

Suppose all three slices are subsets of Ej 1 , Ej 2 , Ej 3 respectively for some j\,ji,j?> G 
{0, 1, 2}, Ei, or E2. We can remove at most five points from the full set Ej 1 l±l Ej 2 l±l Ej 3 . 
Consider columns 2, 3, 4, 6, 7, 8. At most two of these columns contain xyz, so one point 
must be removed from the other four. This uses up all but one of the removals. So 
the slices must be E 2 , Ei, E or a cyclic permutation of that. Then the cube, which 
contains the first square of slice 1; the fifth square of slice 2; and the ninth square of 
slice 3, contains three copies of the same square. It takes more than one point removed 
to remove all lines from that cube. So we can't have all three slices subsets of Ej. 

Suppose one slice is X , Y or Z, and two others are subsets of Ej. We can remove at 
most three points from the two Ej. By symmetry, suppose one slice is X. Consider 
columns 2, 3, 4 and 7. They must be cyclic permutations of x, y, z, and two of them 
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are not xyz, so must lose a point. Columns 6 and 8 must both lose a point, and we 
only have 150 points left. So if one slice is X, Y or Z, the full set contains a line. 

Suppose two slices are from X, Y and Z, and the other is a subset of Ej. By symmetry, 
suppose two slices are X and Y. Columns 3, 6, 7 and 8 all contain w, and therefore 
at most 16 points each. Columns 1, 5 and 9 contain a, b, or c, and therefore at most 
16 points. So the total number of points is at most 7 x 16 + 2 x 18 = 148 < 151, a 
contradiction. □ 

This, combined with Lemma [3.6[ gives 

Corollary 3.9. Any three parallel slices of A must have cardinality 52,50,49 (or a 
permutation thereof). 

Note that this argument already gives the bound 05,3 < 151. 

Lemma 3.10. No slice j * * * * of A is of the form X , where X was defined in Lemma 
[O 



Proof. Suppose one slice is X; then by the previous discussion one of the parallel slices 
has 52 points and is thus of the form Ej for some j = 0, 1, 2, by Lemma 3.4 



Suppose that X is the first slice 1 * * * *. We have X = xyz ybw zwc. Label the other 
rows with letters from the alphabet, thus 

(xyz ybw zwc' 
mno pqr stu 
def ghi jkl 

Reslice the array into a left nine, middle nine and right nine. One of these squares 
contains 52 points, and it can only be the left nine. One of its three columns contains 
18 points, and it can only be its left-hand column, xmd. So m = y and d = z. But none 
of the Ej begins with y or z, which is a contradiction. So X is not in the first row. 

So X is in the second or third row. By symmetry, suppose it is in the second row, so 
that A has the following shape: 

(def ghi jkl 
xyz ybw zwc 
mno pqr stu 

Again, the left-hand nine must contain 52 points, so it is E 2 . Now, to get 52 points in 
any row, the first row must be E 2 . Then the only way to have 50 points in the middle 
or right-hand nine is if the middle nine is X: 

(z'xy xyz 
xyz ybw 
yzx' zwc 

In the seventh column, s contains 5 points and in the eighth column, t contains 4 points. 




The final row can now contain at most 48 points, contradicting Corollary 3.9 
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A similar argument is possible if X is in the third row; or if X is replaced by Y or Z. 
Thus, given any decomposition of A into three parallel slices, one slice is a 52-point set 
Ej and another slice is 50 points contained in E\~. □ 

Now we can obtain the desired contradiction: 

Lemma 3.11. There is no 151-point line-free set A C [3] 5 . 



Proof. Assume by symmetry that the first row contains 52 points and the second row 
contains 50. If E\ is in the first row, then the second row must be contained in Eq: 

(xyz yz'x zxy' 
y'zx zx'y xyz 
def ghi jkl 

But then none of the left nine, middle nine or right nine can contain 52 points, which 



contradicts Corollary |3.9| Suppose the first row is Eq. Then the second row is contained 
in E2, otherwise the cubes formed from the nine columns of the diagram would need to 
remove too many points: 



y'zx zx'y xyz 



A = I z'xy xyz yzx' 
\def ghi jkl 

But then neither the left nine, middle nine nor right nine contain 52 points. So the first 
row contains E2, and the second row is contained in E\. Two points may be removed 
from the second row of this diagram: 

(z'xy xyz yzx' 
xyz yz'x zxy' 
def ghi jkl 

Slice it into the left nine, middle nine and right nine. Two of them are contained in 
Ej so at least two of def, ghi, and jkl are contained in the corresponding slice of 
Eq. Slice along a different axis, and at least two of dgj, ehk, fil are contained in the 
corresponding slice of Eq. So eight of the nine squares in the bottom row are contained 
in the corresponding square of Eq. Indeed, slice along other axes, and all points except 
one are contained within Eq. This point is the intersection of all the 49-point slices. 
So, if there is a 151-point solution, then after removal of the specified point, there is a 
150-point solution, within D 5 j, whose slices in each direction are 52 + 50 + 48. 

(z'xy xyz yzx'\ 
xyz yz'x zxy' 
y'zx zx'y xyz I 

One point must be lost from columns 3, 6, 7 and 8, and four more from the major 
diagonal z'z'z. That leaves 148 points instead of 150. So the 150-point solution does 
not exist with 52 + 50 + 48 slices; so the 151 point solution does not exist. □ 



This establishes that c 5j3 < 150, and thus c 6j3 < 3c 5i3 < 450. 
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Figure 6. Lower bounds for c' n3 obtained by the Ab construction. 
4. Lower bounds for the Moser problem 



Just as for the density Hales- Jewett problem, we found that Gamma sets Y a ^^ c were 
useful in providing large lower bounds for the Moser problem. This is despite the fact 
that the symmetries of the cube do not respect Gamma sets. 

Observe that if B C A n , then the set A B := {J SeB L a )6 c is a Moser set as long as B does 
not contain any "isosceles triangles" (a + r, b, c + s), (a + s, b, c + r), (a, b + r + s, c) for 
any r, s > not both zero; in particular, B cannot contain any "vertical line segments" 
(a+r, b, c+r), (a, b+2r, c). An example of such a set is provided by selecting < i < n—3 
and letting B consist of the triples (a, n—i, i—a) when a^3 mod 3, (a, n— i— 1, a) 
when a ^ 1 mod 3, (a, n — i — 2, i + 2 — a) when a = mod 3, and (a, n — i — 3, i + 3 — a) 
when a = 2 mod 3. Asymptotically, this set includes about two thirds of the spheres 
S n ,i, SrM+i and one third of the spheres 5' nji+2 , S nji+3 and (setting i close to n/3) gives 
a lower bound 

< 3 > (C - o(l))3VVn (4.1) 
with C7 = 2 x \/^-. This lower bound is the asymptotic limit of our methods; see 



Proposition 4.1 below. 



An integer program was solved to obtain the optimal lower bounds achievable by the 



Ab construction (using (2.1), of course). The results for 1 < n < 20 are displayed in 



Figure [6J More complete data, including the list of optimisers, can be found at [T7] . 

Unfortunately, any method based purely on the Ab construction cannot do asymptoti- 
cally better than the previous constructions: 

Proposition 4.1. Let B C A n be such that Ab is a Moser set. Then \Ab\ < (2y^J^ + 

°(D)5=- 

Proof. By the previous discussion, B cannot contain any pair of the form (a, b + 
2r, c), (a + r,b,c + r) with r > 0. In other words, for any —n < h < n, B can contain 
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HHE Hffla HHH 

aaa ffliffl Bfflg 

HHH BfflK #L"J 

aaa ffliffl Bfflg 

^BHS ffl^lH StS^S 
BfflK HMffl aHB 

HHH BfflK #LM 
BfflK HMffl aHE 

#ll ase san 



Figure 7. One of the examples of 353-point sets in [3] 6 (elements of the 
set being indicated by white squares). This example was generated by a 
genetic algorithm. 



at most one triple (a, b, c) with c — a = h. From this and (2.1), we see that 

n . 



A B \ < 



h=—n 



max 



(a,b,c)<=A n :c-a=h d!6!c! 



From the Chernoff inequality (or the Stirling formula computation below) we see that 



a\b\c\ 



< 



1 on 



3 n unless a,b,c = n/3 + 0{n x l 2 log 1//2 n), so we may restrict to this regime, 
which also forces h = 0(n 1 ^ 2 log 1//2 n). If we write a = n/3 + a, b = n/3 + (3, c = n/3 + 7 
and apply Stirling's formula n\ = (1 + o(l))\j2ixnn n e~ r \ we obtain 



n \ Q3/2 

a!o!c! 2™ 



,n Nn ^ 3« x 
-+a log 1+ — 
3 n 



^+/3)iog(i+^; 

3 n 



:^+7)io g (i+^)). 

3 n 
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From Taylor expansion one has 

,n ,., 3a. 3a 2 

- - + a) log 1 + —=-«- — + o 1 
3 n in 

and similarly for /3, 7; since a + /3 + 7 = 0, we conclude that 

1 + o(l))^3 n exp(- + p 2 + 7 2 }) _ 



a!6!c! 27m 2n 

If c - a = h, then a 2 + (3 2 + 7 2 = ^ + ^. Thus we see that 

72J 3 3 / 2 3 

max — — < (l + o(l))— 3"exp(— h 2 ). 
( a ,b,c)eA n :c-a=h alblcl 2nn An 

Using the integral test, we thus have 

o3/2 r a 

\A B \ < (l + o(l))— 3" / exp(— x 2 ) dx. 

2nn J R An 

Since J R exp(— ^x 2 ) dx = y^p, we obtain the claim. □ 



Actually it is possible to improve upon these bounds by a slight amount. Observe that 



if B is a maximiser for the right-hand side of (2.1 ) (subject to B not containing isosceles 
triangles), then any triple (a, b, c) not in B must be the vertex of a (possibly degenerate) 
isosceles triangle with the other vertices in B. If this triangle is non-degenerate, or if 
(a, b, c) is the upper vertex of a degenerate isosceles triangle, then no point from T a ^ c can 
be added to Ab without creating a geometric line. However, if (a, b, c) = (a' + r, b', c' + r) 
is only the lower vertex of a degenerate isosceles triangle (a 1 + r,b' ,c' + r) , (a', b' + 2r, c') , 
then one can add any subset of T a ^ c to Ab and still have a Moser set as long as no pair 
of elements in that subset is separated by Hamming distance 2r. For instance, in the 
n = 5 case, we can start with the 122-point set built from 

B = {(005), (023), (113), (122), (221), (311), (320), (500))} 

and add a point each from (104) and (401). This gives an example of the maximal, 
124-point solution. Again, in the n = 10 case, the set 

B = {(0010), (028), (037), (046), (145), (217), (235), (325), (334), (343), (442), (514), 
(532), (622), (631), (640), (811), (901), (910)} 



generates the lower bound c' 10)3 > 24786 given above (and, up to reflection a •<->■ c, is the 
only such set that does so); but by adding the following twelve elements from r 5! o,5 one 
can increase the lower bound slightly to 24798: 1111133333, 1111313333, 1113113333, 
1133331113, 1133331131, 1133331311, 3311333111, 3313133111, 3313313111, 3331111133, 
3331111313, 3331111331 

A more general form goes with the B set described at the start of this section. Include 
points from r( ajn _j_ 4j j +4 _ a ) when a = l(mod 3), subject to no two points being included 
if they differ by the interchange of a 1 and a 3. Each of these Gamma sets is the feet of 
a degenerate isosceles triangle with vertex r( a _ l n _j_ 2ia +3-a)- 
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Lemma 4.2. If A is a subset of r7^& c \ such that no two points of A differ by the 
interchange of a 1 and a 3, then \A\ < |r aj6)C |/(l + max(a, c)). 

Proof. Say that two points of T a ^ tC are neighbours if they differ by the exchange of a 
1 and a 3. Each point of A has ac neighbours, none of which are in A. Each point of 
r( a ,b,c)\^4 has ac neighbours, but only min(a, c) of them may be in A. So for each point 
of A there are on average ac/ min(a, c) = max(a, c) points not in A. So the proportion 
of points of r( a) 6 iC ) that are in A is at most one in 1 + max(a, c). □ 



The proportion of extra points for each of the cells r( a ri _j_ 4 j + 4_ a ) is no more than 
2/(i + 6). Only one cell in three is included from the b = n — i — 4 layer, so we expect 
no more than ( " 4 )2* +5 /(3i + 18) new points, all from S n ^ + 4. One can also find extra 
points from S nti+5 and higher spheres. 

Earlier solutions may also give insight into the problem. Clearly we have c' 3 = 1 and 
c\ 3 = 2, so we focus on the case n > 2. The first lower bounds may be due to Komlos 
[16J, who observed that the sphere Si yn of elements with exactly n — i 2 entries (see 
Section [Li] for definition), is a Moser set, so that 

4,3 > \Si,n\ (4-2) 

holds for all i. Choosing i = [^J and applying Stirling's formula, we see that this lower 

j-. In particular c 33 



bound takes the form (|4.1[) with C := \l j~- In particular c 33 > 12,c' 43 > 24, c' 5 3 > 



80, 4 3 > 240. 

These values can be improved by studying combinations of several spheres or semi- 
spheres or applying elementary results from coding theory. 

Observe that if {w(l), u>(2), w{3)} is a geometric line in [3] n , then w(l),w(3) both lie in 
the same sphere Si, n , and that w(2) lies in a lower sphere Si- T , n for some 1 < r < i < n. 
Furthermore, w(l) and w(3) are separated by Hamming distance r. 

As a consequence, we see that S^-i,™ U Sf n (or Si_i in U S° n ) is a Moser set for any 
1 < i < n, since any two distinct elements Sf n are separated by a Hamming distance 
of at least two. (Recall Section [ITT] for definitions), This leads to the lower bound 

It is not hard to see that ("i 1 )2 i > ( n ^ 1 )2 4_1 if and only if 3i < 2n + 1, and so this 



lower bound is maximised when i = [^y^J f° r n > 2, giving the formula (1.2). This 
leads to the lower bounds 

4,3 > 6 ; 4,3 > 16 ; 4,3 > 4 °; 4,3 > 12 °; 4,3 > 336 

which gives the right lower bounds for n = 2, 3, but is slightly off for n = 4,5. Asymp- 

3 Y /i 
2 V 4tt ' 



totically, Stirling's formula and (4.3) then give the lower bound (4.1) with C — | x ' " 



which is asymptotically 50% better than the bound (4.2) 
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The work of Chvatal [7] already contained a refinement of this idea which we here 
translate into the usual notation of coding theory: Let A(n, d) denote the size of the 
largest binary code of length n and minimal distance d. 



Then 

c: 



n,3 ^ m f x (j2 (j) A ^ n ~^k-J + 1) j • (4.4) 



The following values of A(n, d) for small n, d are known, see [5]: 



A(l,l) = 2 

A(2,l) = 4 A(2,2) = 2 

A(3, 1) = 8 A(3,2)=4 A(3, 3) = 2 

.4(4,1) = 16 A(4,2) = 8 A(4, 3) = 2 A(4, 4) = 2 

A(5,l) = 32 A(5,2) = 16 A(5,3) = 4 A(5,4) = 2 A(5,5) = 2 

A(6,l) = 64 A(6,2) = 32 A(6, 3) = 8 A(6, 4) = 4 A(6,5) = 2 A(6,6) = 2 

A(7, 1) = 128 A(7,2) = 64 A(7,3) = 16 A(7,4) = 8 A(7, 5) = 2 A(7, 6) = 2 A(7, 7) = 2 

A(8, 1) = 256 A(8,2) = 128 A(8, 3) = 20 A(8, 4) = 16 A(8, 5) = 4 A(8, 6) = 2 A(8, 7) = 2 A(8,8) = 

A(9,l) = 512 A(9,2) = 256 A(9, 3) = 40 A(9, 4) = 20 A(9,5) = 6 A(9, 6) = 4 A(9, 7) = 2 A(9,8) = 

A(10,l) = 1024 A(10, 2) = 512 A(10, 3) = 72 A(10,4) = 40 A(10, 5) = 12 A(10,6) = 6 A(10, 7) = 2 A(10,8) 

A(ll,l) = 2048 A(ll,2) = 1024 A(ll, 3) = 144 A(ll,4) = 72 A(ll,5) = 24 A(ll,6) = 12 A(ll,7) = 2 A(ll,8) 

A(12,l) = 4096 A(12,2) = 2048 A(12,3) = 256 A(12,4) = 144 A(12,5) = 32 A(12,6) = 24 A(12, 7) = 4 A(12,8) 

A(13, 1) = 8192 A(13, 2) = 4096 A(13, 3) = 512 A(13,4) = 256 A(13, 5) = 64 A(12,6) = 32 A(13, 7) = 8 A(13,8) 

In addition, one has the general identities A(n, 1) = 2 n , A(n, 2) = 2 n_1 , A(n— 1, 2e— 1) = 



A(n, 2e), and A(n, d) = 2, if d > 



2n 
3 



Inserting this data into ( 4.4[ ) for k = 2 we obtain the lower bounds 



"4,3 



> QA(4,3) + QA(3,2) + QA(2,1) = 1-2 + 4-4 + 6-4 =42 
c' 5i3 > 6 A(5, 3) + (f)A(4, 2) + QA(3,1) = 1-4 + 5-8 + 10-8 =124 
c' 6)3 > QA(6 ) 3) + Q^(5,2) + ©A(4,l) = l-8 + 6-16 + 15-16 = 344. 

Similarly, with k = 3 we obtain 

c' 73 > QA(7,4) + QA(6,3) + QA(5,2) + QA(4,1) = 960. 
c' 8i3 > ()A(8,4) + («)A(7,3) + g)A(6,2) + ()A(5,l) = 2832 
c' 9i3 > QA(9,4) + QA(8,3) + QA(7,2) + QA(6,1) = 7880 

and for k = 4 we have 

c'io,3 > roA(10,5) + (\°)A(9,4) + ( 1 2 )A(8,3) + (^)A(7,2) + ( 1 4 °)A(6,l) = 22232 
cii l3 > (;V(11,5) + (\V( 1 0,4) + ( 1 2 V(9 ) 3) + ( 1 3 1 )A(8 ) 2) + ( 1 4V(7,1) = 66024 
c' 12)3 > (;V( 12 ,5) + (\ 2 )A(11,4) + ( 1 2 V(10,3) + ( 1 3 2 )A(9,2) + ( 1 4 2 )A( 8 ' 1 ) = 18868 ^ 
and for A; = 5 we have 

c' 13 3 > 539168. 

It should be pointed out that these bounds are even numbers, so that c' 4 3 = 43 shows 
that one cannot generally expect this lower bound to give the optimum. 

The maximum value appears to occur for k = L^pJ , but even after optimising in these 
parameters and using explicit bounds on A(n, d) we were unable to improve upon the 

constant C = 2 x J j- for (4.1) arising from previously discussed constructions. Using 
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the singleton bound A(n, d) < 2 n d+1 Chvatal [7] proved that the expression on the 
ght hand side of (4.4) is also O so th & t the refinement described above gains a 



constant factor over the initial construction only. 

For n = 4 the above does not yet give the exact value. The value c' 43 = 43 was first 
proven by Chandra [6]. A uniform way of describing examples for the optimum values 
of c' 4 3 = 43 and c' 5 3 = 124 is as follows. 

Let us consider the sets 

A := U Sl n U A' 

where A' C S i+ i >n has the property that any two elements in A' are separated by a 
Hamming distance of at least three, or have a Hamming distance of exactly one but 
their midpoint lies in S° n . By the previous discussion we see that this is a Moser set, 
and we have the lower bound 



c 



' >( U + 1 )2 i - 1 + \A'\. (4.5) 



71,3 



This gives some improved lower bounds for c' n3 : 

• By taking n = 4, i = 3, and A' = {1111,3331,3333}, we obtain c' A3 > 43; 

• By taking n = 5, i = 4, and A' = {11111,11333,33311,33331}, we obtain 
4,3 > 124. 

• Bytakingn = 6, i = 5,andA' = {111111,111113,111331,111333,331111,331113}, 
we obtain c' 63 > 342. 



This gives the lower bounds in Theorem 1.5 up to n = 5, but the bound for n = 6 is 
inferior to the lower bound c' e 3 > 344 given above. 



4.1. Higher k values. We now consider lower bounds for c' nk for some values of k 
larger than 3. Here we will see some further connections between the Moser problem 
and the density Hales- Jewett problem. 

For k = 4, we have the lower bounds c' n i > ( n / 2 )2 Tl - To see this, observe that the set 
of points with a Is, b 2s, c 3s and d 4s, where a + d has the constant value n/2, does not 
form geometric lines because points at the ends of a geometric line have more a or d 
values than points in the middle of the line. 

The following lower bound is asymptotically twice as large. Take all points with a Is, 
b 2s, c 3s and d 4s, for which: 

• Either a + d = q or q — 1, a and b have the same parity; or 

• a + d = q — 2 or q — 3, a and b have opposite parity. 

This includes half the points of four adjacent layers, and therefore may include (1 + 
o(l))(")2" +1 points. 
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We also have a lower bound for c' n 5 similar to Theorem 



1~3( namely c' nj5 = 5*»-0(v^i). 
Consider points with a Is, 6 2s, c 3s, d 4s and e 5s. For each point, take the value 
a + e + 2(6 + d) + 3c. The first three points in any geometric line give values that form 
an arithmetic progression of length three. 



Select a set of integers with no arithmetic progression of length 3. Select all points 
whose value belongs to that sequence; there will be no geometric line among those 
points. By the Behrend construction jl], it is possible to choose these points with density 
exp — O(vTogn). 

For k = 6, we observe that the asymptotic c' n6 = o(6 n ) would imply the k = 3 density 
Hales- Jewett theorem c n ^ = o(3 n ). Indeed, any k = 3 combinatorial line-free set can be 
"doubled up" into a k = 6 geometric line-free set of the same density by pulling back 
the set from the map that maps 1, 2, 3, 4, 5, 6 to 1, 2, 3, 3, 2, 1 respectively; note that this 
map sends k = 6 geometric lines to k — 3 combinatorial lines. So c' n6 > 2 n c„ i3 , and 
more generally, c' n2k > 1 n c n ^. 



5. Upper bounds for the k = 3 Moser problem in small dimensions 



In this section we finish the proof of Theorem 1.5 by obtaining the upper bounds on 



c' n 3 for n < 6. 

5.1. Statistics, densities and slices. Our analysis will revolve around various statis- 
tics of Moser sets Ad [3) n , their associated densities, and the behavior of such statistics 
and densities with respect to the operation of passing from the cube [3] n to various slices 
of that cube. 

Definition 5.1 (Statistics and densities). Let A C [3] n be a set. For any < i < n, 

set ai(A) := \Af) S n ^ n \] thus we have 



0<a i {A)<\S n - i , n \= [ n )2 



n—t 



for < i < n and 

a (A) + . . . + a n (A) = \A\. 

We refer to the vector (a (A), . . . , a n (A)) as the statistics of A. We define the i th density 
ai(A) to be the quantity 

ai(A) 
< A ) ■= Jny^—, 

thus < a.i(A) < 1 and 



i=0 ^ ' 



Example 5.2. Let n = 2 and A be the Moser set A := {12, 13, 21, 23, 31, 32}. Then the 
statistics (a (A), ai(A), a 2 {A)) of A are (2, 4, 0), and the densities (a (A), ai(A), a 2 (A)) 
are (|, 1, 0). 
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When working with small values of n, it will be convenient to write a(A), b(A), c(A), 
etc. for a (A), ai(A), a 2 (A), etc., and similarly write a (A), /3(A), "/(A), etc. for a (A), 



a\(A), 02(A), etc. Thus for instance in Example 5.2 we have b(A) = 4 and a(A) = |. 



Definition 5.3 (Subspace statistics and densities). If V is a fc-dimensional geometric 
subspace of [3] n , then we have a map (fry : [3] fe — > [3] n from the fc-dimensional cube 
to the n-dimensional cube. If A C [3]™ is a set and < i < k, we write a^V, A) for 
a i ((j) v 1 (A)) and ati(V, A) for a i (4> v 1 (A)). If the set A is clear from context, we abbreviate 
ai(V,A) as a,i(V) and ai(V,A) as atiiV). 



Recall from Section that the cube [3] n can be subdivided into three slices in n 
different ways, and each slice is an n — 1-dimensional subspace. For instance, [3] 3 can 
be partitioned into 1**, 2* *, 3**. We call a slice a centre slice if the fixed coordinate 
is 2 and a side slice if it is 1 or 3. 



Example 5.4. We continue Example 5.2 Then the statistics of the side slice 1* are 



(ct(l*), b(l*)) = (1,1), while the statistics of the centre slice 2* are (a(2*), 6(2*)) 
(2,0). The corresponding densities are (a(l*), /3(1*)) = (1/2,1) and (a(2*), /3(2*)) 
(1,0). 



A simple double counting argument gives the following useful identity: 

Lemma 5.5 (Double counting identity). Let A C [3] n and < i < n — 1. Then we 
have 

£ a m (F) = ^ I £ a,(iy) = a m (A) 

v a side slice w a centre slice 

where V ranges over the 2n side slices of [3] n , and W ranges over the n centre slices. 
In other words, the average value of ai+i(V) for side slices V equals the average value 
of ai(W) for centre slices W, which is in turn equal to ati + i(A). 



Indeed, this lemma follows from the observation that every string in AflSVi-j-i.n belongs 
to i + 1 centre slices W (and contributes to a>i(W)) and to n — i — 1 side slices V (and 
contributes to Oi+i(V)). One can also view this lemma probabilistically, as the assertion 
that there are three equivalent ways to generate a random string of length n: 



Pick a side slice V at random, and randomly fill in the wildcards in such a way 
that i + 1 of the wildcards are 2's (i.e. using an element of S n -i-2,n-i)- 
Pick a centre slice V at random, and randomly fill in the wildcards in such a 
way that i of the wildcards are 2's (i.e. using an element of S n -i-i jn -i)- 
Randomly choose an element of S n -i-i n . 



Example 5.6. We continue Example 5.2 The average value of (3 for side slices is equal 



to the average value of a for centre slices, which is equal to /3(A) = 1. 



Another very useful fact (essentially due to [8]) is that linear inequalities for statistics 
of Moser sets at one dimension propagate to linear inequalities in higher dimensions: 
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Lemma 5.7 (Propagation lemma). Let n > 1 an integer. Suppose one has a linear 
inequality of the form 

n 
8=0 

for all Moser sets A C [3] ra and some real numbers Vq, . . . , v n , s. Then we also have the 
linear inequality 



^Via qi+r {A) < s 



i=0 



whenever q > 1, r > 0, N > nq + r are integers and A C [3] is a Moser set. 



Proof. We run a probabilistic argument (one could of course also use a double counting 
argument instead). Let n,v , . . . , v n , s, q, r, N, A be as in the lemma. Let V be a random 
n-dimensional geometric subspace of [3]^, created in the following fashion: 



• Pick n wildcards Xx, . . . ,x n to run independently from 1 to 3. We also introduce 
dual wildcards x~x, . . . , x~^\ each x] will take the value 4 — Xj. 

• We randomly subdivide the N coordinates into n groups of q coordinates, plus 
a remaining group of N — nq "fixed" coordinates. 

• For each coordinate in the j th group of q coordinates for 1 < j < n, we randomly 
assign either a Xj or x]. 

• For each coordinate in the N — nq fixed coordinates, we randomly assign a digit 
1, 2, 3, but condition on the event that exactly r of the digits are equal to 2 (i.e. 
we use a random element of Sjv-ng-r,Ar-n g )- 

• Let V be the subspace created by allowing xx, ■ ■ ■ , x n to run independently from 
1 to 3, and x] to take the value 4 — Xj. 

For instance, if n = 2, q = 2, r = 1, N = 6, then a typical subspace V generated in this 
fashion is 

2x 1 x^3x 2 x 1 = {213311, 212321, 211331, 223312, 222322, 221332, 233313, 232323, 231333}. 

Observe from that the following two ways to generate a random element of [3]^ are 
equivalent: 



• Pick V randomly as above, and then assign (xi, . . . ,x n ) randomly from S, 
Assign 4 — Xj to x] for all 1 < j < n. 

• Pick a random string in SN- q i- r ,N- 



Indeed, both random variables are invariant under the symmetries of the cube, and 
both random variables always pick out strings in SN- q i-r,N, arid the claim follows. As a 
consequence, we see that the expectation of aii(V) (as V ranges over the recipe described 
above) is equal to a qi+r (A). On the other hand, from (5.1) we have 

n 
i=0 

for all such V; taking expectations over V, we obtain the claim. □ 
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In view of Lemma |5.7[ it is of interest to locate linear inequalities relating the densities 
Qi(A), or (equivalently) the statistics a^A). For this, it is convenient to introduce the 
following notation. 

Definition 5.8. Let n > 1 be an integer. 



• A vector (a , . . . , a n ) of non-negative integers is feasible if it is the statistics of 
some Moser set A. 

• A feasible vector (a , . . . , a n ) is Pareto- optimal if there is no other feasible vector 
(60, ■ ■ ■ ,b n ) 7^ (a , . . . , a n ) such that hi > for all < i < n. 

• A Pareto-optimal vector (do, . . . ,a n ) is extremal if it is not a non-trivial convex 
linear combination of other Pareto-optimal vectors. 



To establish a linear inequality of the form (5.1) with the Vi non-negative, it suffices to 
test the inequality against densities associated to extremal vectors of statistics. (There 
is no point considering linear inequalities with negative coefficients V{, since one always 
has the freedom to reduce a density oti(A) of a Moser set A to zero, simply by removing 
all elements of A with exactly i 2's.) 



We will classify exactly the Pareto-optimal and extremal vectors for n < 3, which by 
Lemma |5.7 will lead to useful linear inequalities for n > 4. Using a computer, we have 
also located a partial list of Pareto-optimal and extremal vectors for n = 4, which are 
also useful for the n = 5 and n = 6 theory. 



5.2. Up to three dimensions. We now establish Theorem 1.5 for n < 3, and establish 



some auxiliary inequalities which will be of use in higher dimensions. 

The case n = is trivial. When n — 1, it is clear that c' x 3 = 2, and furthermore that 
the Pareto-optimal statistics are (2,0) and (1,1), which are both extremal. This leads 
to the linear inequality 

2a(A) + /3(A) < 2 



for all Moser sets Ac [3] 1 , which by Lemma 5.7 implies that 

2a r {A) + a r+q {A) < 2 (5.2) 
whenever r > 0, q > 1, n > q + r, and A C [3] n is a Moser set. 

For n = 2, we see by partitioning [3] 2 into three slices that c' 2 3 < 3c' 13 = 6, and so (by the 
lower bounds in the previous section) d 2 $ = 6. Writing (a, b, c) = (a(A),b(A),c(A)) = 



(4a(A),4/3(A),~/(A)), the inequalities (5.2) become 



a + 2c < 4; 6 + 2c < A;2a + b <= 8. (5.3) 

Lemma 5.9. When n = 2, the Pareto-optimal statistics are (4, 0, 0), (3, 2, 0), (2, 4, 0), (2, 2, 1). 
In particular, the extremal statistics are (4, 0, 0), (2, 4, 0), (2, 2, 1). 



Proof. One easily checks that all the statistics listed above are feasible. Consider the 
statistics (a, b, c) of a Moser set A C [3] 2 . c is either equal to or 1. If c = 1, 
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then (5.3) implies that a, b < 2, so the only Pareto-optimal statistic here is (2,2,1). 
When instead c = 0, the inequalities (5.3) can easily imply the Pareto-optimality of 
(4,0,0), (3,2,0), (2,4,0). □ 



From this lemma we see that we obtain a new inequality 2a + b + 2c < 8. Converting 



this back to densities and using Lemma 5.7, we conclude that 



Aa T {A) + 2a r+q (A) + a r+2q < 4 (5.4) 
whenever r > 0, g > 1, n > q + 2r, and A C [3] n is a Moser set. 

The line-free subsets of [3] 2 can be easily exhausted by computer search; it turns out 
that there are 230 such sets. 

Now we look at three dimensions. Writing (a, 6, c, d) for the statistics of a Moser set 



A C [3] n (which thus range between (0,0,0,0) and (8,12,6,1)), the inequalities (5.2) 
imply in particular that 

a + 4d < 8; b + 6d < 12; c + 3d < 6; 3a + 2c < 24; b + c < 12 (5.5) 



while (5.4) implies that 

3a + 6 + c< 24; o + c + 3d < 12. (5.6) 
Summing the inequalities b + c < 12, 3a + b + c < 24, b + c + 3d < 12 yields 

3(a + b + c + d) < 48 

and hence \A\ = a+b+c+d < 16; comparing this with the lower bounds of the preceding 
section we obtain c' 3 3 = 16 as required. (This argument is essentially identical to the 



one m 



We have the following useful computation: 

Lemma 5.10 (3D Pareto-optimals). When n = 3, the Pareto-optimal statistics are 
(3, 6, 3, 1), (4, 4, 3, 1), (4, 6, 2, 1), (2, 6, 6, 0), (3, 6, 5, 0), (4, 4, 5, 0), (3, 7, 4, 0), (4, 6, 4, 0), 
(3, 9, 3, 0), (4, 7, 3, 0), (5, 4, 3, 0), (4, 9, 2, 0), (5, 6, 2, 0), (6, 3, 2, 0), (3, 10, 1, 0), (5, 7, 1, 0), 
(6, 4, 1, 0), (4, 12, 0, 0), (5, 9, 0, 0), (6, 6, 0, 0), (7, 3, 0, 0), (8, 0, 0, 0). 

In particular, the extremal statistics are 

(3, 6, 3, 1), (4, 4, 3, 1), (4, 6, 2, 1), (2, 6, 6, 0), (4, 4, 5, 0), (4, 6, 4, 0), (4, 12, 0, 0), (8, 0, 0, 0). 



Proof. This can be established by a brute-force search over the 2 27 1.3 x 10 8 different 
subsets of [3] 3 . Actually, one can perform a much faster search than this. Firstly, 
as noted earlier, there are only 230 line-free subsets of [3] 2 , so one could search over 
230 3 ~ 1.2 x 10 7 configurations instead. Secondly, by symmetry we may assume (after 
enumerating the 230 sets in a suitable fashion) that the first slice A D 1 * * has an index 
less than or equal to the third A D 3 * *, leading to ( 231 ) x 230 ~ 6 x 10 6 configurations 
instead. Finally, using the first and third slice one can quickly determine which elements 
of the second slice 2 * * are prohibited from A. There are 2 9 = 512 possible choices for 
the prohibited set in 2 * *. By crosschecking these against the list of 230 line-free sets 
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one can compute the Pareto-optimal statistics for the second slices inside the prohibited 
set (the lists of such statistics turns out to length at most 23). Storing these statistics 
in a lookup table, and then running over all choices of the first and third slice (using 
symmetry) , one now has to perform 0(512 x230)+O(( 231 )x 23) « O(10 6 ) computations, 
which is quite a feasible computation. 

One could in principle reduce the computations even further, by a factor of up to 8, 
by using the symmetry group D4 of the square [3] 2 to reduce the number of cases one 
needs to consider, but we did not implement this. 

A computer-free proof of this lemma can be found at the page Human proof of the 
3D Pareto-optimal Moser statistics at [23]. □ 

Remark 5.11. A similar computation revealed that the total number of line-free subsets 
of [3] 3 was 3813884. With respect to the 2 3 x 3! = 48-element group of geometric 
symmetries of [3] 3 , these sets partitioned into 83158 equivalence classes: 

3813884 = 76066x48+6527x24+51x16+338x12+109x8+41x6+13x4+5x3+3x2+5x1. 



Lemma 5.10 yields the following new inequalities: 

2a + b + 2c + 4d < 22 
3a + 2b + 3c + 6d < 36 
7a + 2b + 4c + 8d < 56 
6a + 2b + 3c + 6d < 48 
a + 2c + Ad < 14 
5a + 4c + 8d < 40. 



Applying Lemma |5.7[ we obtain new inequalities: 

8a r (A) + 6a r+q (A) + 6a r+2q (A) + 2a r+3q {A) < 11 (5.7) 

Aa r {A) + Aa r+q {A) + {A) + a r+3q {A) < 6 (5.8) 

7a r {A) + 3a r+q {A) + 30<r+2q{A) + a r+ 3 q (A) < 7 

8o> r [A) + 3<y r ^ q {A) + 3tt f ._|_2 <? (A) + a r+3q (A) < 8 (5.9) 
Aa r+q (A) + 2a r+2q (A) + a r+3q (A) < A 
Aa r {A) + 6a r+2q {A) + 2a r+3q {A) < 7 
5a r (A) + 3a r+2q (A) + a r+3q (A) < 5 

whenever r>0, g>l, n > r + 3g, and Moser sets A C [3] n . 
We also note some further corollaries of Lemma 15.101 

Corollary 5.12 (Statistics of large 3D Moser sets). Let (a,b,c,d) be the statistics of a 
Moser set A in [3] 3 . Then \A\ = a + b + c + d < 16. Furthermore: 

• If \A\ — 16 ; then (a, 6, c, d) = (4, 12, 0, 0) . 



DENSITY HALES-JEWETT AND MOSER NUMBERS 



31 



• If \A\ — 15 ; then (a, b, c, d) = (4, 11, 0, 0) or (3, 12, 0, 0) . 

• If \A\ > 14, then b>6andd = 0. 

• If \A\ = 13 and d = 1, then (a, b, c, d) = (4, 6, 2, 1) or (3, 6, 3, 1). 



5.3. Four dimensions. Now we establish the bound c' 43 = 43. Let A be a Moser 
set in [3] 4 , with attendant statistics (a,b,c,d,e), which range between (0,0,0,0,0) and 
(16,32,24,8,1). In view of the lower bounds, our task here is to establish the upper 
bound a+b+c+d+e< 43. 

The linear inequalities already established just barely fail to achieve this bound, but we 
can obtain the upper bound a+b+c+d+e< 44 as follows. First suppose that e = 1; 



then from the inequalities (5.2) (or by considering lines passing through 2222) we see 
that a < 8, b < 16, c < 12, d < 4 and hence a + b + c + d + e < 41, so we may assume 
that e = 0. 



From Lemma 5.5, we see that a+b+c+d+e is now equal to the sum of a(V) / 'A+b(V) /3 + 
c(V)/2 + d(V), where V ranges over all side slices of [3] 4 . But from Lemma 5.10 we see 
that a(V)/4 + b(V)/3 + c(V)/2 + d(V) is at most ^, with equality occurring only when 
(a(V),b(V),c(V),d(V)) = (2,6,6,0). This gives the upper bound a + b + c + d + e < 44. 

The above argument shows that a + b + c + d + e = 44 can only occur if e = and if 
(a(V),b(V),c(V),d(V)) = (2,6,6,0) for all side slices V. Applying Lemma 5.10 again 
this implies (a,b,c,d,e) = (4,16,24,0,0). But then A contains all of the sphere 5*2,4, 
which implies that the four-element set A PI cannot contain a pair of strings which 
differ in exactly two positions (as their midpoint would then lie in S^, contradicting 
the hypothesis that A is a Moser set). 

Recall that we may partition £4,4 = U S% 4 , where 

Sf 4 := {1111, 1133, 1313, 3113, 1331, 3131, 3311, 3333} 

is the strings in with an even number of l's, and 

Sl A := {1113, 1131, 1311, 3111, 1333, 3133, 3313, 3331} 

are the strings in ^4,4 with an odd number. Observe that any two distinct elements in 
S4 4 differ in exactly two positions unless they are antipodal. Thus A n SI 4 has size at 
most two, with equality only when A n Sf 4 consists of an antipodal pair. Similarly for 
A n S44. Thus A must consist of two antipodal pairs, one from Sl 4 and one from S% A . 

By the symmetries of the cube we may assume without loss of generality that these 
pairs are {1111,3333} and {1113,3331} respectively. But as A is a Moser set, A must 
now exclude the strings 1112 and 3332. These two strings form two corners of the 
eight-element set 

* * *2 n 5 3 , 4 = {1112, 1132, 1312, 3112, 1332, 3132, 3312, 3332}. 

Any pair of points in this set which are "adjacent" in the sense that they differ by exactly 
one entry cannot both lie in A, as their midpoint would then lie in S^, and so A can 
contain at most four elements from this set, with equality only if A contains all the 
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points in * * *2 fl 53,4 of the same parity (either all the elements with an even number of 
3s, or all the elements with an odd number of 3s). But because the two corners removed 
from this set have the opposite parity (one has an even number of Is and one has an odd 
number), we see in fact that A can contain at most 3 points from this set. Meanwhile, 
the same arguments give that A contains at most four points from **2*n5 , 3i4 , *2**n5 , 3i4 , 
and 2***nS , 3 i 4. Summing we see that b = {AnS^^l < 3 + 4+4 + 4 = 15, a contradiction. 
Thus we have c' 4 3 = 43 as claimed. 



We have the following four-dimensional version of Lemma |5.10 

Lemma 5.13 (4D Pareto-optimals). When n = A, the Pareto- optimal statistics are 
given by the table in Figure [#l 



Proof. This was computed by computer search as follows. First, one observed that if 
(a, b, c, d, e) was Pareto-optimal, then a > 3. To see this, it suffices to show that for any 
Moser set A C [3] 4 with a(A) = 0, it is possible to add three points from 64,4 to A and 
still have a Moser set. To show this, suppose first that A contains a point from £^4, 
such as 2221. Then A must omit either 2211 or 2231; without loss of generality we may 
assume that it omits 2211. Similarly we may assume it omits 2121 and 1221. Then we 
can add 1131, 1311, 3111 to A, as required. Thus we may assume that A contains no 
points from Si j4 . Now suppose that A omits a point from 5 , 2i 4, such as 2211. Then one 
can add 3333, 3111, 1311 to A, as required. Thus we may assume that A contains all of 
5*2,4, which forces A to omit 2222, as well as at least one point from S^, such as 2111. 
But then 3111, 1111, 3333 can be added to the set, a contradiction. 

Thus we only need to search through sets A C [3] 4 for which \A fl 6^4 1 > 3. A straight- 
forward computer search shows that up to the symmetries of the cube, there are 391 
possible choices for A D 64,4. For each such choice, we looped through all the possible 
values of the slices A fl 1 * ** and A fl 3 * **, i.e. all three-dimensional Moser sets which 
had the indicated intersection with S 3>3 . (For fixed A n 64,4, the number of possibilities 
for A fl 1 * ** ranges from 1 to 87123, and similarly for A fl 3 * **). For each pair of 
slices A fl 1 * ** and A fl 3 * **, we computed the lines connecting these two sets to see 
what subset of 2 * ** was excluded from A; there are 2 27 possible such exclusion sets. 
We precomputed a lookup table that gave the Pareto-optimal statistics for A fl 2 * ** 
for each such choice of exclusion set; using this lookup table for each choice of An 1 * ** 
and A fl 3 * ** and collating the results, we obtained the above list. On a Linux clus- 
ter, the lookup table took 22 minutes to create, and the loop over the An 1 * ** and 
An 3 * ** slices took two hours, spread out over 391 machines (one for each choice of 
A PI S^4). Further details (including source code) can be found at the page 4D Moser 
brute force search of [21]. □ 

As a consequence of this data, we have the following facts about the statistics of large 
Moser sets: 

Proposition 5.14. Let A C [3] 4 be a Moser set with statistics (a,b,c,d,e). 
(i) If \A\ > AO, then e = 0. 
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a (b, c, d, e) 

3 (16,24) 

4 (14, 19, 2), (15, 24), (16, 8, 4, 1), (16, 14, 4), (16, 23), (17, 21), (18, 19) 

5 (12,12,4,1), (12,13,6), (12,15,5), (12,19,2), (13,10,4,1), (13,14,5), (13,21,1), (15,9,4,1), (15,12,3,1), (15,13,5), (15,18,3), 

(15.20.1) , (15,22), (16,7,4,1), (16,10,3,1), (16,11,5), (16,12,2,1), (16,16,3), (16,19,1), (16,21), (17,12,4), (17,14,3), 

(17.16.2) , (17,18,1), (17,20), (18,13,3), (18,14,2), (20,8,4), (20,10,3), (20,13,2), (20,14,1), (20,18), (21,10,2), 

(21. 15) , (22, 13) 

6 (8,12,8), (10,11,4,1), (11,12,7), (12,10,7), (12,13,5), (12,18,4), (13,16,4), (14,9,4,1), (14,9,7), (14,12,6), (14,16,3), 
(14,19,1), (14,21) (15,7,4,1), (15,10,3,1), (15,10,6), (15,11,2,1), (15,12,5), (15,15,4), (15,20), (16,7,3,1), (16,8,6), 
(16,9,2,1), (16,10,5), (16,12,1,1), (16,13,4), (16,14,3), (16,18,2), (16,19), (17,9,5), (17,10,4), (17,13,3), (17,15,2), (17,17,1), 
(17,18), (18,13,2), (18,16,1), (18,17), (19,9,4), (19,12,3), (19,15,1), (20,7,4), (20,9,3), (20,12,2), (20,13,1), (20,15), (21,8,3), 

(21.9.2) , (21,12,1), (21,14), (22,7,3), (22,8,2), (22,10,1), (23,9,1), (24,7,2), (24,8,1), (24,12), (25,9), (26,7) 

7 (8,6,8), (11,9,4,1), (11,12,6), (12,8,4,1), (12,8,6), (12,12,3,1), (12,12,5), (12,13,4), (12,15,3), (12,17,2), (13,7,4,1), 
(13,10,3,1), (13,11,5), (13,12,2,1), (13,12,4), (13,14,3), (13,16,2), (14,6,4,1), (14,6,7), (14,9,5), (14,10,2,1), (14,12,1,1), 
(14,17,1), (14,19), (15,7,5), (15,8,3,1), (15,9,2,1), (15,11,1,1), (15,11,4), (15,13,3), (15,16,1), (16,6,3,1), (16,6,6), 
(16,8,2,1), (16,10,1,1), (16,10,4), (16,12,0,1), (16,12,3) (16,15,2), (16,17), (17,6,5), (17,7,4), (17,11,3), (17,13,2), (17,14,1), 

(17.16) , (18,10,3), (18,13,1), (18,15), (19,9,3), (20,6,4), (20,11,2), (20,12,1), (20,14), (21,8,2), (21,10,1), (21,12), 
(22,9,1), (22,11), (23,6,3), (23,7,1), (23,10), (24,6,2), (24,9), (25,6,1), (25,8), (26,3,1), (28,6), (29,3), (30,1) 

8 (8,0,8), (8,9,7), (8,12,6), (9,9,4,1), (9,10,6), (9,12,3,1), (9,12,5), (9,13,4), (9,15,3), (10,7,4,1), (10,10,3,1), (10,10,5), 
(10,12,2,1), (10,12,4), (10,13,3), (10,15,2), (11,6,4,1), (11,9,6), (11,10,2,1), (11,11,4), (12,7,6), (12,9,3,1), (12,9,5), 
(12,10,4), (12,12,1,1), (12,14,2), (12,16,1), (12,18), (13,7,3,1), (13,7,5), (13,9,2,1), (13,12,0,1), (13,12,3), (14,0,7), 
(14,6,6), (14,7,2,1), (14,8,1,1), (14,9,4) (14,11,0,1), (14,11,3), (14,13,2), (14,15,1), (14,17), (15,6,3,1), (15,6,5), 
(15,7,1,1), (16,0,6), (16,4,3,1), (16,4,5), (16,6,2,1), (16,8,4), (16,9,0,1), (16,10,3), (16,12,2), (16,14,1), (16,16) (17,0,5), 
(17,3,4), (17,8,3), (17,10,2), (17,12,1), (17,14), (18,9,2), (18,11,1), (18,12), (19,6,3), (19,8,2), (20,0,4), (20,4,3), (20,7,2), 
(20,9,1), (20,11), (21,4,2), (21,7,1), (22,3,2), (22,6,1), (22,9), (23,0,3), (23,4,1), (24,0,2), (24,3,1), (24,8), (25,1,1), 
(25,6), (26,0,1), (26,4), (28,3), (32) 

9 (8,10,4), (9,9,4), (9,12,3), (10,8,4), (10,10,3), (10,12,2), (10,13,1), (10,15), (11,11,2), (12,7,4), (12,9,3), (12,12,1), (12,14), 

(13.7.3) , (13,10,2), (14,9,2), (14,11,1), (14,13), (15,6,3), (16,0,4), (16,4,3), (16,8,2), (16,10,1), (16,12), (17,3,3), (17,6,2), 
(17,8,1), (17,10), (18,2,3) (18,4,2), (18,7,1), (18,9), (19,0,3), (19,3,2), (19,6,1), (20,1,2), (20,5,1), 

(20,8), (21,4,1), (21,6), (22,1,1), (22,5), (24,4), (25,2), (28) 

10 (8,6,4), (8,8,3), (9,7,3), (9,10,2), (9,11,1), (9,13), (10,5,4), (10,9,2), (10,12), (11,6,3), (12,4,4), (12,5,3), (12,7,2), (12,10,1), 
(12,11), (13,6,2), (13,8,1), (13,10), (14,3,3), (14,5,2), (14,9), (15,2,3), (15,7,1), (16,4,2), (16,6,1), (16,8), (17,4,1), (17,6), 
(18,2,1), (18,5), (20,4), (21,2), (22,1), (24) 

11 (4,6,4), (6,5,4), (7,6,3), (8,4,4), (8,5,3), (9,6,2), (9,8,1), (9,10), (10,3,3), (10,5,2), (10,9), (11,2,3), (11,7,1), (12,4,2), 
(12,6,1), (12,8), (13,4,1), (13,6), (14,2,1), (14,5), (16,4), (17,2), (18,1), (20) 

12 (4,3,3), (6,2,3), (6,5,2), (6,7,1), (6,9), (8,4,2), (8,6,1), (8,8), (9,4,1), (9,6), (10,2,1), (10,5), (12,4) 
(13,2), (14,1), (16) 

13 (6,5), (8,4), (9,2), (10,1), (12) 

14 (4,3), (5,2), (6,1), (8) 

15 (4) 

16 (0) 



Figure 8. The Pareto-optimal statistics (a,b,c,d,e) of Moser sets in 
[3] 4 . To save space, all statistics with the same value of a have been 
collected in a single row; also, trailing zeroes for (b, c, d, e) have been 
dropped, thus for instance (6, c) is short for (6, c, 0, 0). This table can also 
be found at http : / /spreadsheets . google . com/ ccc?key=rwXB_Rn3QlZf 5yaeMQL-RDw. 

(ii) If\A\ > 43, thend = 0. 

(Hi) If\A\ > 42, then d<2. 

(iv) If \A\ > 41, then d<3. 

(v) If \A\ > 40, then d<6. 

(vi) If \A\ > 43, thenc> 18. 

(vii) If \A\ > 42, thenc> 12. 

(viii) If \A\ > 43, then b > 15. 

Remark 5.15. This proposition was first established by an integer program, see the 
file integer.tex at [21]. A computer-free proof can be found at 

terrytao . files . wordpress . com/2009/06/polymath2 .pdf . 
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5.4. Five dimensions. Now we establish the bound c' 5 3 = 124. In view of the lower 
bounds, it suffices to show that there does not exist a Moser set A C [3] 5 with \A\ = 125. 

We argue by contradiction. Let A be as above, and let (a(A), . . . , f(A)) be the statistics 
of A. 

Lemma 5.16. f(A) = 0. 

Proof. If f(A) is non-zero, then A contains 22222, then each of the = 121 antipodal 
pairs in [3] 5 can have at most one point in A, leading to only 122 points. □ 

Let us slice [3] 5 into three parallel slices, e.g. 1****,2****,3****. The intersection 
of A with each of these slices has size at most 43. In particular, this implies that 

|Anl****| + |,4n3****| = 125- |A n 2 * * * *| > 82. (5.10) 

Thus at least one ofAnl****, An 3* * * * has cardinality at least 41; by Proposition 



5.14[ iv) we conclude that 

min(d(l ****), d(3 ****))< 3. (5.11] 



Furthermore, equality can only hold in (5.11) if A fl 1 * * * *, A fl 3 * * * * both have 
cardinality exactly 41, in which case from Proposition 5.14[ iv) again we must have 



d(l ****)= d{3 ****)= 3. (5-12) 
Of course, we have a similar result for permutations. 

Now we improve the bound |A fl 2 * * * *| < 43: 
Lemma 5.17. \A H 2 * * * *| < 41. 

Proof. Suppose first that \A n 2 * * * *| = 43. Let A' C [3] 4 be the subset of [3] 4 
corresponding toAfl2****, thus A 1 is a Moser set of cardinality 43. By Proposition 
5.14( vi), c(A') > 18. By Lemma [BT5| the sum of the c(V), where V ranges over the 



eight side slices of [3] 4 , is therefore at least 36. By the pigeonhole principle, we may 
thus find two opposing side slices, say 1 * ** and 3 * **, with c(l * **) + c(3 ****)> 9. 
Since c(l * **), c(3 * **) cannot exceed 6, we thus have c(l * **), c(3 * **) > 3, with at 
least one of c(l * **),c(3 * **) being at least 5. Passing back to A, this implies that 
d(*l * **), d(*3 * **) > 3, with at least one of d(*l * **), d(*3 * **) being at least 5. But 
this contradicts (5.11) together with the refinement ( 5.12[ ). 



We have just shown that \A fl 2 * * * *| < 42; we can thus improve (5.10) to 

i^ni****| + |An3****| >83. 



Combining this with Proposition 5.14[ ii)-(v) we see that 



d{l ****) + d{3 ****)< 6 (5.13) 
with equality only if|v4fl2****|=42, and similarly for permutations. 
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c(l * **) + c(3 * **) < 6 
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and similarly for permutations. Applying Lemma 5.5, this implies that c(2 * * * *) = 
c(A') < 12. 

Now suppose for contradiction that \A'\ = \A (1 2 * * * *\ = 42. Then by Proposition 



5.14 vii) we have 



c(2 ****) = 12; 



(5.14) 



applying Lemma 5.5 again, this forces c(l * **) + c(3 * **) = 6 and similarly for permu- 
tations, which then implies that 

d(*l***)+d(*3***) = d(**l**)+d(**3**) = d(***l*)+d(***3*) = d(****l)+d(****3) = 6 

(5.15) 

and hence 



\Ar\*2***\ = \Ar\**2**\ = \An***2*\ = \Ar\****2\=42 



and thus 



c(*2 * **) = c(* * 2 * *) = c(* * *2*) = c(* * * * 2) = 12. 



Combining (5.14), (5.15), (5.16) we conclude that 



d(l ****) + d(3 ****) = 16, 



contradicting (5.13). 



(5.16) 



□ 



With this proposition, the bound (5.10) now improves to 



|y4ni****| + |Ari3****| >84 

and in particular 

|Ani****|,|An3****| > 41. 

from this and Proposition 5.14[ ii)-(iv) we now have 

d{\ ****) + d(3 ****)< 4 
and similarly for permutations. 
Lemma 5.18. e(A) = 0. 



(5.17) 
(5.18) 
(5.19) 



Proof. From (5.17), the intersection of A with any side slice has cardinality at least 41, 



and thus by Proposition 5.14 i) such a side slice has an e-statistic of zero. The claim 
then follows from Lemma 5.5 □ 



We need a technical lemma: 

Lemma 5.19. Let B C S^. Then there exist at least \B\ 
which differ in exactly two positions. 



4 pairs of strings in B 
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Proof. The first non- vacuous case is \B\ — 5. It suffices to establish this case, as the 
higher cases then follow by induction (locating a pair of the desired form, then deleting 
one element of that pair from B). 

Suppose for contradiction that one can find a 5-element set B C £5,5 such that no two 
strings in B differ in exactly two positions. Recall that we may split 6*5,5 = 5f 5 U S^ 5 , 
where 5f 5 are those strings with an even number of l's, and S§ 5 are those strings with 
an odd number of l's. By the pigeonhole principle and symmetry we may assume B 
has at least three elements in S£ 5 . Without loss of generality, we can take one of them 
to be 11111, thus excluding all elements in Sg 5 with exactly two 3s, leaving only the 
elements with exactly four 3s. But any two of them differ in exactly two positions, a 
contradiction. □ 

We can now improve the trivial bound c(A) < 80: 

Corollary 5.20 (Non-maximal c). c(A) < 79. If a{A) > 7, then c(A) < 78. 



Proof. If c(A) = 80, then A contains all of 63,5, which then implies that no two elements 



in A n S^fi can differ in exactly two places. It also implies (from (5.2)) that d(A) must 
vanish, and that b(A) is at most 40. By Lemma 5.19, we also have that a(A) = {AnS^^l 
is at most 4. Thus \A\ <4 + 40 + 80 + + = 124, a contradiction. 



Now suppose that a(A) > 7. Then by Lemma 5.19 there are at least three pairs in 
A PI 5*5,5 that differ in exactly two places. Each such pair eliminates one point from 
A PI 5*3,5; but each point in 53,5 can be eliminated by at most two such pairs, and so we 
have at least two points eliminated from A D 53,5, i.e. c(A) < 78 as required. □ 



Next, we rewrite the quantity 125 = \A\ in terms of side slices. From Lemmas 5.16 
15.181 we have 

a(A) + b(A) + c(A) + d(A) = 125 
and hence by Lemma [5~5 the quantity 



s(V) : = a(V) + \b(V) + \c(V) + \d(V) - 
where V ranges over side slices, has an average value of zero. 



125 



Proposition 5.21 (Large values of s(V)). For all side slices, we have s(V) < 1/2. 
Furthermore, we have s(V) < —1/2 unless the statistics (a(V) , b(V) , c(V) , d(V) , e(V)) 
are of one of the following four cases: 



(Type 1) (a{V),b{V),c{V),d{V),e{V)) = (2,16,24,0,0) (and s(V) = -1/2 and 
\A PI V\ =42); 

(Type 2) (a(V), b(V), c(V), d(V), e(V)) = (4, 16, 23, 0, 0) (and s(V) = -1/6 and 
\A PI V\ =43); 

(Type 3) (a(V),b(V),c(V),d(V),e(V)) = (4,15,24,0,0) (and s(V) = 1/4 and 
\A PI V\ =43); 
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• (Type 4) {a{V),b{V),c{V),d{V),e(V)) = (3,16,24,0,0) (and s(V) = 1/2 and 
\AnV\ =43); 



Proof. Let V be a side slice. From (5.18) we have 

41 < a(V) + b(V) + c(V) + d(V) = \A n V\ < 43. 

First suppose that \A D V\ = 43, then from Proposition 5.14[ ii), (viii), d(V) = and 
b(V) > 15. Also, we have the trivial bound c(V) < 24, together with the inequality 

3b(V) + 2c(V) < 96 



from (5.2). To exploit these facts, we rewrite s(V) as 



«00 



-(24 - c(V)) (96 - 3b(V) - 2c(V)). 



1 

2 2 V ~ ~ v " 12' 
Thus s(V) < 1/2 in this case. If s(V) > -1/2, then 

6(24 - c(V)) + (96 - 36(F) - 2c(V)) < 12, 

which together with the inequalities b(V) < 15, c(V) < 24, 3b(V) + 2c{V) < 96 we 
conclude that (b(V),c(V)) must be one of (16,24), (15,24), (16,23), (15,23). The 
first three possibilities lead to Types 4,3,2 respectively. The fourth type would lead to 
(a(V),b(V),c{V),d{V),e{V)) = (5,15,23,0,0), but this contradicts dg7b. 



Next, suppose \A D V | = 42, so by Proposition 5.14[ iii) we have d(V) < 2. From (5.2) 
we have 

2c(V) + 3d(V) < 48 (5.20) 



while from (5.4) we have 



3b(V) + 2c(V) + 3d(V) < 96 



(5.21) 



and so we can rewrite s(V) as 



S (y) 



i 



(4f 



1 



-2c(V)-3d(V))--(96- 



- 3b{V) - 2c{V) - 3d{V)) + -d{V) . (5.22) 



1 

2 1 ' ~ ' ' v " 12' ' ' v " ' 2' 

This already gives s(V) < 1/2. If d(F) = 0, then s(V) < —1/2, with equality only in 
Type 1. If d(V) = 1, then the set A' C [3] 4 corresponding to A fl V contains a point 
in 53,4, which without loss of generality we can take to be 2221. Considering the three 
lines *221, 2 * 21, 22 * 1, we see that at least three points in 62,4 must be missing from 
A, thus c(V) < 21. This forces 48 - 2c(V) - 3d(V) > 3, and so s(V) < -3/4. Finally, 
if d(V) = 2, then A contains two points in 63,4. If they are antipodal (e.g. 2221 and 
2223), the same argument as above shows that at least six points in 6*2,4 are missing from 
A; if they are not antipodal (e.g. 2221 and 2212) then by considering the lines *221, 
2 * 21, 22 * 1, *212, 2 * 12 we see that five points are missing. Thus we have c(V) < 19, 
which forces 48 — 2c(V) — 3d(V) > 4. This forces s(V) < —1/2, with equality only when 
c(V) = 19 and 3b(V) + 2c(V) + 3d{V) = 96, but this forces b{V) to be the non-integer 
52/3, a contradiction, which concludes the treatment of the \A fl V\ =42 case. 



Finally, suppose \An V\ = 41. Using (5.20), (5.21) as before we have 
s(V) = - 3 --\(A8-2c(V)-3d(V)) 



1 ( 96 _ 3b( y) - 2c(V) - 3d(V)) + id(V ) , (5.23) 
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while from Proposition |5. 14 vi) we have d(V) < 3. This already gives s(V) < 0, and 
S (V) < — 1 when d(V) = 1. In order to have s(V) > —1/2, we must then have 
d(V) = 2 or d(V) = 3. But then the arguments of the preceding paragraph give 
48 - 2c(V) - 3d(V) > 4, and so s(V) < -1 in this case. □ 



Since the s(V) average to zero, by the pigeonhole principle we may find two opposing 
side slices (e.g. 1 * * * * and 3 * * * *), whose total s- value is non-negative. Actually we 
can do a little better: 

Lemma 5.22. There exists two opposing side slices whose total s-value is strictly pos- 
itive. 



Proof. If this is not the case, then we must have s(l ****) + s(3 * * * *) = and 



similarly for permutations. Using Proposition 5.21 we thus see that for every opposing 
pair of side slices, one is Type 1 and one is Type 4. In particular c(V) = 24 for all side 
slices V. But then by Lemma 5.5 we have c(A) = 80, contradicting Lemma 5.20 □ 



Let V,V be the side slices in Lemma |5.22| By Proposition 5.21 , the V,V slices must 
then be either Type 2, Type 3, or Type 4, and they cannot both be Type 2. Since 
a(A) = a(V) + a(V), we conclude 

6 < a(A) < 8. (5.24) 

In a similar spirit, we have 

c(V) + c(V) < 23 + 24. 

On the other hand, by considering the 24 lines connecting c-points of V, V to c-points 
of the centre slice W between V and V, each of which contains at most two points in 
A, we have 

c(V) + c{W) + c{V) < 24 x 2. 

Thus c(W) < 1; since 

d(A) = d{V) + d(V) + c{W) 



we conclude from Proposition 5.21 that d(A) < 1. Actually we can do better: 
Lemma 5.23. d(A) = 0. 



Proof. Suppose for contradiction that d(A) = 1; without loss of generality we may 
take 11222 G A. This implies that d(l * * * *) = d(*l * **) = 1. Also, by the 
above discussion, c(* * 1 * *) and c(* * 3 * *) cannot both be 24, so by Proposition 



5.21, s(* *!**) + s(* * 3 * *) < 1/3; similarly s(* * *1*) + s(* * *3*) < 1/3 and 



s{* * * * 1) + s(* * * * 3) < 1/3. Since the s average to zero, we see from the pigeonhole 
principle that either s(l ****) + s(3 * * * *) > —1/2 or s(*l * **) + s(*3 * **) > —1/2. 
We may assume by symmetry that 

s(l ****) + s(3 ****)> —1/2. (5.25) 



Since s(3 ****)< 1/2 by Proposition 5.21, we conclude that 

s(l****)>-l. (5.26) 
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If \A PI 1 * * * *| = 41, then by (5.23) we have 
1 



s(l****j 



I — -(48— 2c(l****)— 3d(l****))— — (96— 36(1****)— 2c(l****)— 3d(l****)) 



2c(l * * * *) 

42 (by Proposition 5. 14 ^ii) and ( |5.18[ )). In that case, from (5.22) we 



but the arguments in Proposition |5.21 give 48 — 2c(l ****) — 3d(l * * * *) > 3 and 
96 — 36(1 ****) — 2c(l ****) — 3d(l ****)> 0, a contradiction. So we must have 

|yi n l * * * *| 

have 

s(l****) = -(48— 2c(l ****) — 3g?(1** **)) — — (96— 36(1****)— 2c(l ****) — 3d(l ****)) 

while also having 48 — 2c(l ****) — 3d(l ****)> 3 and 96 — 36(1 ****) — 2c(l ****) — 
3d(l ****)> 0. Since s(l ****)> —1 and d(l ****) = 1, we soon see that we must 
have 48 — 2c(l****) — 3d(l****) = 3 and 96 — 36(1****) — 2c(l****) — 3d(l****) < 3, 
which forces c(l * * * *) = 21 and 6(1 ****) = 16 or 6(1 ****) = 17; thus the statistics 
of 1 * * * * are either (4, 16, 21, 1, 0) or (3, 17, 21, 1, 0). 

We first eliminate the (3,17,21,1,0) case. In this case s(l * * * *) is exactly — 1. 
Inspecting the proof of (5.26), we conclude that s(3 * * * *) must be +1/2 and that 
s(* * 1 * *) + s{* * 3 * *) = 1/3. From the former fact and Proposition 5.21 we see that 
a(A) = a(l ****) + a(3 ****) = 3 + 3 = 6; on the other hand, from the latter fact and 
Proposition 5.21 we have a(A) = a(* *!**) + a(* *3**) = 4 + 3 = 7, a contradiction. 



So 1 * * * * has statistics (4,16,21,1,0), which implies that s(l * * * *) = —3/4 and 
\An 1 * * * *| =42. By (5.25) we conclude 

s(3 ****)> 1/4, (5.27) 

which by Proposition 5.21 implies that |A H3 * * * *| = 43, and hence |An 2 * * * *| = 40. 
On the other hand, since e(A) = f(A) = and d(A) = 1, with the latter being caused 



by 11222, we see that c(2 ****) = d(2 ****)= e(2 ****)= 0. From (5.2) we have 



4a(2****) + 6(2****) < 64, and we also have the trivial inequality 6(2****) < 32; these 
inequalities are only compatible if 2 * * * * has statistics (8, 32, 0, 0, 0), thus A fl 2 * * * * 
contains S% 5 D 2 * * * *. 



If a(3 ****)= 4, then a(A) = a(l ****) + a(3 ****) = 8, which by Proposition 5.21 



implies that s(* * 1 * *) + s(* * 3 * *) cannot exceed 1/12, and similarly for permutations. 
On the other hand, from Proposition 5.21 s(* * 1 * *) + s(* * 3 * *) cannot exceed 
—3/4 + 1/4 = —1/2, and so the average value of s cannot be zero, a contradiction. 
Thus a (3 * * * *) 7^ 4, which by (5.27) and Proposition 5.21 implies that * * 3 * * has 
statistics (3,16,24,0,0). 



In particular, A contains 16 points from 3 * * * * fl Si^ and all of 3 * * * * fl S^s- As a 
consequence, no pair of the 16 points in An3****nSi 5 can differ in only one coordinate; 
partitioning the 32-point set 3 * * * * fl S^s into 16 such pairs, we conclude that every 
such pair contains exactly one element of A. We conclude that y4fl3****n Si t s is 
equal to either 3****flS'i 5 or3****nS'° 5 . 

On the other hand, A contains all of 2 * * * * fl 6*2,5, and exactly sixteen points from 
1 * * * * n Sip. Considering the vertical lines *xyzw where xyzw G Si^, we conclude 



40 



D.H.J. POLYMATH 



that v4nl****n 51,5 is either equal to l****nS° 5 or 1 * * * * n S{ 5 . But either case 
is incompatible with the fact that A contains 11222 (consider either the line llxx2 or 
llxx~2, where x — 1, 2, 3 and x = 4 — x), obtaining the required contradiction. □ 

We can now eliminate all but three cases for the statistics of A: 

Proposition 5.24 (Statistics of A). The statistics (a(A),b(A),c(A),d(A),e(A), f(A)) 
of A must be one of the following three tuples: 

• (Case 1) (6,40,79,0,0); 

• (Case 2) (7,40,78,0,0); 

• (Case 3) (8,39,78,0,0). 



Proof. Since d(A) 



f(A) 



e(A) 
c(2 * * * * 



= 0, we have 
d{2 



e(2 



* * * * , 



0. 



On the other hand, from (5.2) we have 4a(2 ****) + 6(2 * * * *) < 64 as well as the 
trivial inequality 6(2 ****)< 24, and also we have 

\A n 2 * * * *| = 125 - \A n 1 * * * *| - \A n 3 * * * *| > 125 - 43 - 43 = 39. 

Putting all this together, we see that the only possible statistics for 2 * * * * are 
(8, 32, 0, 0, 0), (7, 32, 0, 0, 0), or (8, 31, 0, 0, 0). In particular, 7 < a(2 * * * *) < 8 and 
31 < b(2****) < 32, and similarly for permutations. Applying Lemma 5.5 we conclude 
that 

35 < b{A) < 40 

and 

77.5 < c(A) < 80. 

Combining this with the first part of Corollary 5.20 we conclude that c(A) is either 78 
or 79. From this and (5.24) we see that the only cases that remain to be eliminated 
are (7, 39, 79, 0, 0) and (8, 38, 79, 0, 0), but these cases are incompatible with the second 
part of Corollary |5.20| □ 



We now eliminate each of the three remaining cases in turn. 



5.5. Elimination of (6,40,79,0,0). Here A fl S^s has six points. By Lemma 5.19 



there are at least two pairs in this set which differ in two positions. Their midpoints 
are eliminated from A fl S^. But A omits exactly one point from S^^, so these mid- 
points must be the same. By symmetry, we may then assume that these two pairs are 
(11111,11133) and (11113,11131). Thus the eliminated point in S 3)5 is 11122, i.e. A 
contains 5 , 3 i 5\{11122}. Also, A contains {11111,11133,11113,11131} and thus must 
omit {11121, 11123, 11112, 11132}. 



Since 11322 G A, at most one of 11312, 11332 lie in A. By symmetry we may assume 
11312 ^ A, thus there is a pair (xylz2, xy3z2) with x, y, z — 1, 3 that is totally omitted 
from A, namely (11112, 11312). On the other hand, every other pair of this form can 
have at most one point in the A, thus there are at most seven points in A of the form 
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xyzw2 with x,y,z,w = 1,3. Similarly there are at most 8 points of the form xyz2w, or 
of xy2zw, x2yzw, 2xyzw, leading to b(A) <7 + 8 + 8 + 8 + 8 = 39, contradicting the 
statistic b(A) = 40. 



5.6. Elimination of (7,40,78,0,0). Here AnS^^ has seven points. By Lemma 5.19 
there are at least three pairs in this set which differ in two positions. As we can only elim- 
inate two points from S 3t5 , two of the midpoints of these pairs must be the same; thus, 
as in the previous section, we may assume that A contains {11111, 11133, 11113, 11131} 
and omits {11121, 11123, 11112, 11132} and 11122. 

Now consider the 160 lines I connecting two points in S 4)5 to one point in S^s (i.e. 
*2xyz and permutations, where x,y,z = 1,3). By double counting, the total sum of 
\£ n A\ over all 160 lines is Ab(A) + 2c(A) = 316 = 158 x 2. On the other hand, each of 
these lines contain at most two points in A, but two of them (namely 1112* and 1112*) 
contain no points. Thus we must have \£ n A\ =2 for the remaining 158 lines I. 

Since A omits 1112a; and 111x2 for x — 1,3, we thus conclude (by considering the lines 
11 * 2x and 11 * x2) that A must contain 1132a;, 113x2, 1312a;, and 131x2. Taking 
midpoints, we conclude that A omits 11322 and 13122. But together with 11122 this 
implies that at least three points are missing from AnS^, contradicting the hypothesis 
c(A) = 78. 



5.7. Elimination of (8,39,78,0,0). Now A fl S5 5 has eight points. By Lemma 5.19 



there are at least three pairs in this set which differ in two positions. As we can 
only eliminate two points from S 3t5 , two of these pairs (a, b), (c,d) must have the same 
midpoint p, and two other pairs (a', b'), (c', d!) must have the same midpoint p', and A 
contains S3^\{p,p'}. As p, p' are distinct, the plane containing a,b,c,d is distinct from 
the plane containing a', b', c', d'. 



Again consider the 160 lines I from the previous section. This time, the sum of the 
\£ H A\ is 4b(A) + 2c(A) = 312 = 156 x 2. But the two lines in the plane of a,b,c,d 
passing through p, and the two lines in the plane of a', b', c', d' passing through p', have 
no points; thus we must have \£ H A\ =2 for the remaining 156 lines I. 

Without loss of generality we have (a, b) = (11111, 11133), (c, d) = (11113, 11131), thus 
p = 11122. By permuting the first three indices, we may assume that p' is not of 
the form x2y2z,x2yz2,xy22z,xy2z2 for any x,y,z = 1,3. Then we have 1112x ^ A 
and 1122x G A for every x = 1,3, so by the preceding paragraph we have 1132x G A; 
similarly for 113x2, 1312x, 131x2. Taking midpoints, this implies that 13122, 11322 ^ A, 
but this (together with 11122) shows that at least three points are missing from AdS^^, 
contradicting the hypothesis c(A) = 78. 



5.8. Six dimensions. Now we establish the bound Cg 3 = 353. In view of the lower 
bounds, it suffices to show that there does not exist a Moser set AG [3] 5 with \A\ — 354. 
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We argue by contradiction. Let A be as above, and let (a(A), . . . , g(A)) be the statistics 
of A. 

Lemma 5.25. g(A) = 0. 



Proof. For any four-dimensional slice V of A, define 

S(V) := 15a(V) + 56(F) + 5c(V)/2 + 3d(V)/2 + e(V). 



From Lemma 5.5 we see that |A| is equal to a(A) + b(A) plus the average of S(V) where 



V ranges over the twenty slices which are some permutation of the center slice 22 * * * *. 



If g(A) = 1, then a(A) < 32 and b(A) < 96 by (5.2). Meanwhile, e(V) = <?(A) = 1 for 



every center slice V, so from Lemma [5.13 , one can show that S(V) < 223.5 for every 
such slice. We conclude that \A\ < 351.5, a contradiction. □ 



For any four-dimensional slice V of A, define the defects to be 

D(V) := 356 - [4a(V) + Qb(V) + 10c(V) + 20d(V) + 60e(V)\. 

Define a corner slice to be one of the permutations or reflections of 11 * * * *, thus there 
are 60 corner slices. From Lemma 5.5 we see that 356 — \A\ + f{A) = 2 + f(A) is the 
average of the defects of all the 60 corner slices. On the other hand, from Lemma 5.13 
and a straightforward computation, one concludes 

Lemma 5.26. Let A be a four- dimensional Moser set. Then D(A) > 0. Furthermore: 



• If A has statistics (6, 12, 18,4,0), then D(A) = 0. 

• If A has statistics (5, 12, 18, 4, 0), (5, 12, 12, 4, 1), or (6, 8, 12, 8, 0), then D(A) = 
4. 

• For all other A, D(A) > 6. 

• Ifa(A) = A, then D(A) > 8. 

• If a (A) > 7, then D(A) > 16 (with equality iff A has statistics (7,11,12,6,0)^. 

• Ifa(A) > 8, then D(A) > 30. 

• Ifa(A) > 9, then D(A) > 86. 



Define a family to be a set of four parallel corner slices, thus there are 15 families, which 
are all a permutation of {11****, 13****, 31****, 33 ****}. We refer to the family 
{11 * * * 13 * * * *, 31 * * * *, 33 * * * *} as a6 * * * *, and similarly define the family 
a * b * **, etc. 

Lemma 5.27. f(A) = 0. 



Proof. For any four-dimensional slice V of A, define 

s(V) := 12a(V) + 156(V)/2 + 20c(V)/3 + 15d(V)/2 + 12e(V), 

and define an edge slice to be one of the 60 permutations or reflections of 12 * * * *. 
From double counting we see that \A\ — a(A) is equal to the average of the 60 values of 
s(V) as V ranges over edge slices. 
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From Lemma 5.13 one can verify that s(V) < 336, and that s(V) < 296 = 336 — 40 if 



eiy) = 1. The number of edge slices V for which e(V) = 1 is equal to 5f(A), and so 
the average value of the s(V) is at most 336 — and so 



"60 



\A\ - a{A) < 336 - ^— -f(A) 



which we can rearrange (using \A\ — 354) as 

a(A)>18+™f(A). 

Suppose first that f(A) > 3; then a(A) > 28. Then in any given family, there is a 
corner slice with an a value at least 9, or four slices with a value at least 7, or two slices 
with a value at least 8, or one slice with a value 8 and two of a value at least 7, leading 
to a total defect of at least 60 by Lemma |5.26[ Thus the average defect is at least 15; 
on the other hand, the average defect is 2 + f(A) < 2 + 12, a contradiction. 

Now suppose that f(A) = 2; then a(A) > 25. This means that in any given family, 



one of the four corner slices has an a value of at least 7, and thus by Lemma 5.26 has 



a defect of at least 16. Thus the average defect is at least 4; on the other hand, the 



average defect is 2 + f(A) = 4. From Lemma 5.26, this implies that in any given family, 
three of the corner slices have statistics (6,12,18,4,0) and the last one has statistics 
(7, 11, 12, 6, 0). But this forces b(A) = 70.5 by double counting, which is absurd. 

The remaining case is when f(A) = 1. Here we need a different argument. Without 
loss of generality we may take 122222 G A. The average defect among all 60 slices is 
2 + f(A) = 3. Equivalently, the average defect among all 15 families is 12. 

First suppose that a(A) ^ 24. Then in every family, at least one of the corner slices 
needs to have an a value distinct from six, and so the average defect in each family is 
at least 4. Thus the five families ab****,a*b***,a**b**,a***b*,a****b have 
an average defect of at most 28, which implies that the ten corner slices beginning with 
1 (or equivalently, adjacent to an edge slice containing 122222) is at most 14. In other 
words, if (a, b, c, d, e) is the average of the statistics of these ten corner slices, then 

4a + 66 + 10c + 20d + 60e > 342. 

On the other hand, (a, 6, c, d, e) must lie in the convex hull of the statistics of four- 



dimensional Moser sets, which are described by Lemma 5.10 Also, as A contains 



122222, one has c/24,d/8,e < 1/2 by considering lines with centre 122222. Finally, 



from (5.8) and double-counting one has 7c/24 + 3d/8 + 3e + 1 < 6. Inserting these 
facts into a standard linear program yields a contradiction; indeed, the maximal value 
of 4a + 66 + 10c + 20d + 60e with these constraints is 338 1, attained when (a, b, c, d, e) = 



f, 16,12,4,|; 



Finally, we consider the case when f(A) = 1 and a (A) = 24. The preceding arguments 
allow the average defect of the ten corner slices beginning with 1 to be as large as 18, 
which implies that 4a + 66 + 10c+20d + 60e > 338. Linear programming shows that this 
is not possible if a > 6, thus a < 6. But this forces one of the corner slices beginning 
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with 3 to have an a value of at least 7, and thus to have a defect of at least 16 by 
Lemma |5.10| Repeating the preceding arguments, this increases the lower bound for 



4a + 6b + 10c + 20d + 60e by f , to 339.6; but this is now inconsistent with the upper 
bound of 338 1 from linear programming. □ 



As a consequence of the above lemma, we see that the average defect of all corner slices 
is 2, or equivalently that the total defect of these slices is 120. 

Call a corner slice good if it has statistics (6, 12, 18, 4, 0), and bad otherwise. Thus good 
slices have zero defect, and bad slices have defect at least four. Since the average defect 
of the 60 corner slices is 2, there are at least 30 good slices. 

One can describe the structure of the good slices completely: 

Lemma 5.28. The subset of[3] A consisting of the strings 1111, 1113, 3333, 1332, 1322, 
1222, 3322 and permutations is a Moser set with statistics (6, 12, 18,4,0). Conversely, 
every Moser set with statistics (6, 12, 18, 4, 0) is of this form up to the symmetries of the 
cube [3] 4 . 



Proof. This can be verified by computer. By symmetry, one assumes 1222,2122,2212 
and 2221 are in the set. Then 18 of the 24 'c' points with two 2s must be included; 
it is quick to check that 1122 and permutations must be the six excluded. Next, one 
checks that the only possible set of six 'a' points with no 2s is 1111, 1113, 3333 and 
permutations. Lastly, in a rather longer computation, one finds there is only possible 
set of twelve '6' points, that is points with one 2. A computer-free proof can be found 
at the page Classification of (6,12,18,4,0) sets at [21]. □ 



As a consequence of this lemma, given any x,y,z,w G {1,3}, there is a unique good 
Moser set in [3] 4 set whose intersection with Si^ is {x222, 2y22, 22z2, 222w}, and these 
are the only 16 possibilities. Call this set the good set of type xyzw. It consists of 

• The four points a;222, 2y22, 22z2, 222w in S 1A ; 

• All 24 elements of £2,4 except for xy22, x2z2, x22w, 2yz2, 2y2w, 22zw; 

• The twelve points xYZ2, xY2W, x2ZW, XyZ2, Xy2W, 2yZW, XYz2, X2zW, 
2YzW, XY2w, X2Zw, 2YZw in S 3}4 , where X = A-x, Y = A-y, Z = A-z, 
W = 4-w; 

• The six points xyzw, xyzW, xyZw, xYzw, Xyzw, XYZW in £4,4. 

We can use this to constrain the types of two intersecting good slices: 

Lemma 5.29. Suppose that the pq**** slice is of type xyzw, and the p*r*** slice is 
of type x'y'z'w' , where p, q, r, x, y, z, w, x' , y' , z' , w' are in {1, 3}. Then x' = x iff q — r, 
and y'z'w' is equal to either yzw or YZW. If x = r (or equivalently if x' = q), then 
y'z'w' = yzw. 
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Proof. By reflection symmetry we can take p = q = r = 1. Observe that the 11 * * * * 
slice contains 111222 iff x — 1, and the 1*1*** slice similarly contains 111222 iff x' = 1. 
This shows that x — x' . 

Suppose now that x = x' = 1. Then the 111 * ** slice contains the three elements 
llly22, 1112,22, 11122™, and excludes 111Y22, 1112Z2, 11122W, and similarly with the 
primes, which forces yzw = y'z'w' as claimed. 

Now suppose that x = x' = 3. Then the 111 * ** slice contains the two elements 
lllyzw, 111YZW, but does not contain any of the other six points in S^q H 111 * **, 
and similarly for the primes. Thus y'z'w' is equal to either yzw or YZW as claimed. □ 



Now we look at two adjacent parallel good slices, such as 11 * * * * and 13 * * * *. The 
following lemma asserts that such slices either have opposite type, or else will create a 
huge amount of defect in other slices: 

Lemma 5.30. Suppose that the 11 * * * * and 13 * * * * slices are good with types xyzw 
and x 'y'z'w' respectively. If x = x' , then the 1 * x * ** slice has defect at least 30, and 
the 1 *X * ** slice has defect at least 8. Also, the 1**1**, 1**3** ; 1***1*, 1* * *3* ; 
1****1 ; 1****3 slices have defect at least 6. In particular, the total defect of slices 
beginning with 1* is at least 74. 



Proof. Observe from the 11****, 13**** hypotheses that a(l * x * **) = 9 and 
a(l * X * **) = 4, which gives the first two claims by Lemma 5.261 For the other claims, 



one sees from Lemma 5.29 that the other six slices cannot be good; also, they have 
an a-value of 6 and a <i-value of at most 7, and the claims then follow from Lemma 
EM □ 



Now we look at two diagonally opposite parallel good slices, such as 11 * * * * and 
33 *h ^ *fc *^ . 

Lemma 5.31. The 11 * * * * and 33 * * * * slices cannot both be good and of the same 
type. 



Proof. Suppose not. By symmetry we may assume that 1 1 * * * * and 33 * * * * are of type 
1111. This excludes a lot of points from 22 * * * *. Indeed, by connecting lines between 
the 11**** and 33**** slices, we see that the only points that can still survive in 22**** 
are 221133,221333,221132,223332, and permutations of the last four indices. Double 
counting the lines 22133* and permutations we see that there are at most 12 points one 
can place in the permutations of 221133, 221333, 221132, and so the 22 * * * * slice has 
at most 16 points. Meanwhile, the two five-dimensional slices 1 * * * **, 3 * * * ** have 
at most c' 5 3 = 124 points, and the other two four-dimensional slices 21****, 23**** 
have at most c' 4 3 = 43 points, leading to at most 16 + 124 * 2 + 43 * 2 = 350 points in 
all, a contradiction. □ 

Lemma 5.32. It is not possible for all four slices in a family to be good. 
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Proof. Suppose not. By symmetry we may assume that 11****, 13****, 31****, 33**** 



are good. By Lemma 5.31 , the 11 * * * * and 33 * * * * slices cannot be of the same 
type, and so they cannot both be of the opposite type to either 13 * * * * or 31 * * * *. 
If 13 * * * * is not of the opposite type to 11 * * * *, then by (a permutation of) Lemma 



5.30, the total defect of slices beginning with 1* is at least 74; otherwise, if 13 * * * * is 



not of the opposite type to 33 * * * *, then by (a permutation and reflection of) Lemma 



5.30 , the total defect of slices beginning with *3 is at least 74. Similarly, the total defect 
of slices beginning with 3* or *1 is at least 74, leading to a total defect of at least 148. 
But the total defect of all the corner slices is 2 x 60 = 120, a contradiction. □ 



Corollary 5.33. At most one family can have a total defect of at least 38. 

Proof. Suppose there are two families with defect at least 38. The remaining thirteen 



families have defect at least 4 by Lemma 5.32 and Lemma 5.26, leading to a total defect 



of at least 38 * 2 + 13 * 4 = 128. But the total defect is 2 x 60 = 120, a contradiction. □ 
Actually we can refine this: 

Proposition 5.34. No family can have a total defect of at least 38. 

Proof. Suppose for contradiction that the ab * * * * family (say) had a total defect of at 



least 38, then by Corollary 5.33 no other families have total defect at least 38. 



We claim that the * * ab * * family can have at most two good slices. Indeed, suppose 



the * * ab * * has three good slices, say * * 11 * *, * * 13 * *, * * 33 * *. By Lemma [5.31 
the * * 11 * * and * * 33 * * slices cannot be of the same type, and so cannot both be of 
opposite type to ** 13* *. Suppose * * 11 * * and * * 13* * are not of opposite type. Then 



by (a permutation of) Lemma 5.30, one of the families a*6***, *a&***, **6*a*, **6**a 



has a net defect of at least 38, contradicting the normalisation. 

Thus each of the six families * * ab * *,* * a * b*,* * a * *b, * * *ab*, * * *a * b,* * * * ab 
have at least two bad slices. Meanwhile, the eight families a*6***,a**6**,a*** 
fr*, a * * * *b, *ab * **, *a * b * *, *a * *&*, *a * * * b have at least one bad slice by Corollary 



5.32, leading to twenty bad slices in addition to the defect of at least 38 arising from 



the ab * * * * slice. To add up to a total defect of 120, we conclude from Lemma 5.26 



that all bad slices outside of the ab * * * * family have a defect of four, with at most 



one exception; but then by Lemma 5.30 this shows that (for instance) the 1*1*** and 
1*3*** slices cannot be good unless they are of opposite type. The previous argument 
then shows that the a*b*** slice cannot have three good slices, which increases the 
number of bad slices outside of ab * * * * to at least twenty-one, and now there is no 
way to add up to 120, a contradiction. □ 

Corollary 5.35. Every family can have at most two good slices. 



Proof. If for instance 11****, 13****, 33**** are all good, then by Lemma 5.31 at 
least one of 11 * * * *, 33 * * * * is not of the opposite type to 13 * * * *, which by Lemma 
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5.30 implies that there is a family with a total defect of at least 38, contradicting the 
previous proposition. □ 



From this corollary and Lemma 5.26 we see that every family has a defect of at least 
8. Since there are 15 families, and 8 x 15 is exactly equal to 120, we conclude 

Corollary 5.36. Every family has exactly two good slices, and the remaining two 
slices have defect 4. In particular, by Lemma 5.26 , the bad slices must have statistics 
(5,12,18,4,0), (5,12,12,4,1), or (6,8,12,8,0). 



We now limit how these slices can interact with good slices. 
Lemma 5.37. Suppose that 1*1*** is a good slice. 

(i) The 11 * * * * slice cannot have statistics (6,8, 12,8,0). 

(ii) The 11 * * * * slice cannot have statistics (5, 12, 12,4, 1). 

(hi) // the 11 * * * * slice has statistics (5,12,18,4,0), then the 112 * ** slice has 
statistics (3, 9, 3, 0). 

Proof. This can be verified through computer search; there are 16 possible configura- 
tions for the good slices, and one can calculate that there are 27520 configurations for 
the (5,12,12,4,1) slices, 4368 configurations for the (5,12,18,4,0) slices, and 80000 
configurations for the (6, 8, 12, 8, 0) slices. It is then a routine matter to inspect by 
computer all the potential counterexamples to the above lemma. □ 

Corollary 5.38. The 111*** slice has statistics (4,3,3,1), (2,6,6,0), (3,3,3,1), or 
(1,6,6,0). 



Proof. From Corollary 5.36 we know that at least one of the slices 13****, 31*** 
*, 11 * * * * are good. If 11 * * * * or 1 * 1 * ** is good, then the slice 111 * ** has 



statistics (4, 3, 3, 1) or (2, 6, 6, 0), by Lemma 5.28. By symmetry we may thus reduce to 



the case where 13 * * * * is good and 1 * 1 * ** is bad. Then by Lemma 5.37, the 1*1*** 
slice has statistics (5,12,18,4,0) and the 121 * ** slice has statistics (3,9,3,0). Since 
the 131 * ** slice, as a side slice of the good 13 * * * * slice, has statistics (4, 3, 3, 1) or 
(2, 6, 6, 0), we conclude that the 111 * ** slice has statistics (1, 6, 6, 0) or (3, 3, 3, 1), and 
the claim follows. □ 

Corollary 5.39. All corner slices have statistics (6,12,18,4,0) or (5,12,18,4,0). 



Proof. Suppose first that a corner slice, say 11 * * * * has statistic (6, 8, 12, 8, 0). Then 
111 * ** and 113 * ** contain one "d" point each, and have six "a" points between 



them, so by Corollary 5.38, they both have statistic (3, 3, 3, 1). This forces the 1 * 1 * **, 



1*3*** slices to be bad, which by Corollary |5.36| forces the 3*1***, 3*3*** slices 
to be good. This forces the 311 * **,313 * ** slices to have statistics either (2,6,6,0) 
or (4,3,3,1). But the 311 * ** slice (say) cannot have statistic (4,3,3,1), since when 
combined with the (3,3,3,1) statistics of 111 * ** would give a(*ll * **) = 7, which 
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contradicts Corollary 5.36 thus the 311 * ** slice has statistic (2, 6, 6, 0), and similarly 
for 331 * **. But then a(3 * 1 * **) = 4, which again contradicts Corollary 5.36 



Thus no corner slice has statistic (6,8,12,8,0). Now suppose that a corner slice, say 
11**** has statistic (5, 12, 12, 4, 1). By Lemma 5.37, the 1*1***, 1*3*** slices are bad, 



so by repeating the preceding arguments we conclude that the 311***, 313*** slices 
have statistics (2, 6, 6, 0) or (4, 3, 3, 1); in particular, their a- value is even. However, the 



*11 * ** and *13 * ** slices are bad by Lemma [5.37[ and thus have an a- value of 5; thus 
the 111 * ** and 113 * ** slices have an odd a-value. Thus forces a(ll * * * *) to be even; 
but it is equal to 5, a contradiction. □ 



From this and Lemma [53 , we see that A has statistics (22, 72, 180, 80, 0, 0, 0). In par- 
ticular, we have 2a 2 (A) + a%(A) = 2, which by double counting (cf. ( |5.2 )) shows that 
for every line of the form 11122* (or a reflection or permutation thereof) intersects A 
in exactly two points. Note that such lines connect a "d" point to two "c" points. 



Also, we observe that two adjacent "d" points, such as 111222 and 113222, cannot both 
lie in A; for this would force the *13 * ** and *11 * ** slices to have statistics (4, 3, 3, 1) 
or (3, 3, 3, 1) by Corollary 5.38, which forces a(*l ****) = 6, and thus *1 * * * * must 
be good by Corollary 5.39; but this contradicts Lemma 5.28 Since 013(A) = 1/2, we 
conclude that given any two adjacent "d" points, exactly one of them lies in A. In 
particular, the d points of the form * * *222 consist either of those strings with an even 
number of Is, or those with an odd number of Is. 



Let's say it's the former, thus the set contains 111222, 133222, and permutations of the 
first three coordinates, but omits 113222,333222 and permutations of the first three 
coordinates. Since the "d" points 113222, 333222 are omitted, we conclude that the "c" 
points 113122, 113322, 333122, 333322 must lie in the set, and similarly for permutations 
of the first three and last three coordinates. But this gives at least 15 of the 16 "c" 
points ending in 22; by symmetry this leads to 225 c-points in all; but c(A) = 180, 
contradiction. This (finally!) completes the proof that c' 63 = 353. 
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